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Abstract 

In the stochastic network model of Britton and Lindholm [Dynamic random networks in dynamic 
populations. Journal of Statistical Physics, 2010], the number of individuals evolves according to 
a linear birth and death process with per-capita birth rate A and per-capita death rate fi < X. 
A random social index is assigned to each individual at birth, which controls the rate at which 
connections to other individuals are created. Britton and Lindholm give a somewhat rough proof 
for the convergence of the degree distribution in this model towards a mixed Poisson distribution. 
We derive a rate for this convergence giving precise arguments. In order to do so, we deduce 
the degree distribution at finite time and and derive an approximation result for mixed Poisson 
distributions to compute an upper bound for the total variation distance to the asymptotic degree 
distribution. We prove several general results about linear birth and death processes along the way. 
Most notably, we derive the age distribution of an individual picked uniformly at random at some 
finite time. 
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1 Introduction 

“Network Science” is a relatively young, rapidly growing research area dealing with complex systems 
with an underlying graph structure (see e.g. the recent book by van der Hofstad [vdH15]). One of 
the first random graph models is the well-known Erdos-R&yi model, which is a static model, i.e. 
it describes a random network at a fixed time. Although this model shows interesting behaviour, 
further random graph models were needed for the description of real networks since some empirical 
networks have important properties that are not represented by Erdds-Renyi Graphs. One of these 
is the power law property of the degree distribution. Preferential attachment models exhibit such a 
power law behaviour asymptotically and were popularized by Barabasi and Albert [BA99]. Pekoz, 
Rollin and Ross [PRR13] established convergence rates for the degree distribution in a discrete-time 
preferential attachment model in terms of the total variation distance. An alternative to preferential 
attachment models is the time-continuous random graph model that was introduced by Britton and 
Lindholm [BLIO] and that we consider here. The main difference between this model and the pref¬ 
erential attachment models is that in the former the popularity of each node is determined at birth 
and stays the same for the whole lifetime whereas the popularity in preferential attachment models is 
proportional to the degree of the node, which changes over time. Depending on the application either 
model may be more realistic. In particular it can be shown that the degrees of nodes in the model by 
Britton and Lindholm can exhibit power laws, such that this model provides an interesting alternative 
for producing graphs with this property. The fact that time is considered continuously and nodes and 
connections can not only be created but also destroyed again makes the time-evolution more realistic 
than in the usual stepwise preferential attachment models. 

Let us give a loop-free version of the definition of the original dynamic network model by Britton 
and Lindholm] see [BLIO] and [BLTlIj: 

We examine a finite undirected graph without loops that develops over time. The node process (Tt)t>o 
is a linear birth and death process with initial value one. Thus each node gives birth at constant rate 
A and dies at constant rate p. independently from other nodes. We assume that (lt)t>o has right- 
continuous trajectories. The process {Yt)t>o and all other random variables that are defined in what 
follows to describe the dynamic random graph are defined on a common underlying probability space 

{n,A,F). 

We assume X > fi, so that the node process (Lj)t>o is a supercritical continuous-time Markov 
branching process. From standard branching process theory, we obtain that for a random variable Wt 
with C{Wt) = C{Yte~^^^~^^^Yt > 0), we have 


Wt^W a.s., 

where W is Exp(^^) distributed (see e.g. [Har50], page 319). 
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We equip every node i with a positive random social index S'j, where the Si are i.i.d. with finite 
expectation and independent of all other random variables. 

This allows us to define the development of the edge set. At birth every node is isolated. During its 
lifetime and as long as there is at least one other node, node i generates and destroys edges according 
to a birth and death process with constant birth rate aSi and per-edge death rate j3. Here a and /3 
are positive constants. The “second” node of each newly born edge is chosen uniformly at random 
from the set of all other living nodes, and all of the edge processes (including the choices of the second 
nodes) are independent of each other and all other events. 

In addition to the direct destruction of edges in the above process, all edges connected to a certain 
node are removed when this node dies. 

1.1 Remark 

The only difference to the definition by Britton and Lindholm is that we do not allow loops because 
these are not present in most applications. Note that the proofs become slightly simpler if we use 
the original model by Britton and Lindholm, essentially because times where Yt = 1 need not obtain 
special treatment. The upper bounds remain largely the same; see also Remark 5.3 for the pure birth 
case. 

Note that we still allow multiple edges. It can be shown that those are negligible in the sense that 
the probability that a randomly picked node has at least one multiple edge converges to zero at an 
exponential rate (see Section 6). This allows us to formulate the main result also for the case where 
we ignore multiple edges (see Corollary 1.4 below). 

We refer to the distribution of the number of edges incident to a node picked uniformly at random 
from all living nodes at time t given the number of nodes is positive as degree distribution, and denote it 
by ut- In [BLIO], Britton and Lindholm give a somewhat rough proof for the convergence of the degree 
distribution in the original model towards a mixed Poisson distribution u. It is the main purpose of 
this paper to give a rate for the convergence in total variation distance of nt to this asymptotic degree 
distribution n giving precise arguments. The distribution u is given by 

= MixPo {S + HS)) (1 - , 

where A ~ Exp(A), S has the social index distribution, and A and S are independent. Here MixPo 
denotes the mixed Poisson distribution. The following result is an immediate consequence of Theo¬ 
rems 5.2 and 5.4 below, which are proved in this paper. 

1.2 Theorem 

Let E(S'^) < oo. Then we obtain for the degree distribution ut in the Britton-Lindholm Model without 
loops 

(a) if ^ = 0, then dryii't, z^) = 0 [^/ie~ 2 ^^'j as t —)■ oo; 

(b) if ^ > 0, then = 0{t‘^e~&^^~^^^) as t —)■ oo. 

1.3 Remark 

A positive death rate n leads to a higher variability in the degree distributions for finite t since in 
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particular the death of a highly connected node (hub) can have a large impact. Thus we would not 
expect the same rate as in the pure birth case. However, the actual factor in the exponential rate in 
(b) may be larger than the one stated in the theorem. 

Theorem 1.2 has consequences for the case where we ignore multiple edges. Let vt be the distribution 
of the number of neighbours of a node picked uniformly at random from all living nodes at time t 
given the number of nodes at time t is positive. We refer to vt shortly as distribution of the number 
of neighbours. The convergence of this distribution to the asymptotic degree distribution u is an 
immediate consequence of the following corollary, which is proved in Section 6. 

1.4 Corollary 

Let E(5'^) < oo. For the distribution of the number of neighbours in the Britton-Lindholm Model 
without loops, we have that as t ^ oo. 

1.5 Remark 

Britton and Lindholm also introduced a modified model where the “second” node of each newly 
born edge is not picked uniformly. Instead the probability for each node being the “second” node is 
proportional to its social index. We expect that one can prove similar results for this modified model 
in an analogous way. Note that the modification typically leads to a substantially higher ratio of 
multiple edges, which makes the modified version less interesting for applications. 

The rest of the paper is organized as follows. In the second section, we use Poisson approximation to 
obtain an upper bound for the total variation distance between two mixed Poisson distributions, which 
we need in Section 5. In Section 3, we consider distributions related to linear birth and death processes 
since the node population is described by such a process. In particular, we derive the age distribution 
for a linear birth and death process using results about contour processes from [BPS12]. In Section 4, 
we derive the degree distribution at finite time by using results about birth and death processes. 
Finally, in Section 5, we derive rates for the convergence to the asymptotic degree distribution in 
terms of the total variation distance treating the pure birth case and the general case separately, 
which leads to Theorem 1.2 above. 


2 Upper bound for the total variation distance between two mixed 
Poisson distributions 

Theorem 2.1 in [Yan91] gives us an upper bound for the total variation distance between two Poisson 
distributions. By conditioning we can generalize this result to mixed Poisson distributions. 

2.1 Theorem 

Let A and M be positive real valued random variables. Then we have for the total variation distance 
between the mixed Poisson distributions MixPo(A) and MixPo(M): 

(iry(MixPo(A), MixPo(M)) < E(min(|\/A - \/M|, |A - M|). 
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Proof: Let X and Y be MixPo(A) and MixPo(M) distributed, respectively. Then it follows 

dTV'(MixPo(A),MixPo(M)) = sup \F{X G A) - P(y G A)\ 

AcNo 

= sup |E(P(X G A|A)) - E(P(y G A|M))| 

AcNo 

< sup E(|P(X G A|A) - P(y G A|M)|) 

AcNo 

< e( sup |(P(X G A|A)) - (P(y G A|M))| ) 

VAcNo / 

= E(dTV'(/:(^|A),£(y|M)) 

< E(min(|\/A - Vm\, |A - M|), 

where the last line follows from Theorem 2.1 in [Yan91]. □ 

2.2 Remark 

Note that the mixed Poisson distribution depends on the parameter random variable only via its 
distribution; therefore Theorem 2.1 yields 

(MixPo(A), MixPo(M)) < inf E(min(|A 2 — |, |A — M|). 

A:A=A 

M:M=M 


3 Distributions related to linear birth and death processes 


3.1 Total population size 

The node process iYt)t>o is a linear birth and death process with birth rate A, death rate /j, < X and 
initial value one, i.e. Yq = 1. Since some results could also be of general interest, we also consider the 
case where /x > A > 0. Firstly, we assume that the birth rate A and the death rate /x of iYt)t>o are not 
equal. According to (8.15) and (8.46) in [Bai64], the one-dimensional distributions of such a process 
are given by the following probability mass functions; 


where 

For A = 


Po{t) = fipit), 

Pn{f) = (1 - /ip(t))(l - Xp{f)){Xp{t)Y~^, n > 1, 

- 1 1 1 - e-A-M)t 

^ ^ ■“ Xei^-Yjt _ xi_ • 

p > 0, the probability mass function is given by 


Po{t) 

Pn{t) 


Xt 

1 -h At’ 
(At)^-i 
(1 Xt)^+^ ’ 


rx > 1, 


by (8.53) in [Bai64]. 
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3.1 Remark 

Note that po{t) is the probability that a linear birth and death process with initial value one goes 
extinct up to time t. Due to the branching property of a linear birth and death process, we have 
that po{t)^ is the probability that a linear birth and death process with a general initial value m goes 
extinct up to time t. By taking the limit t —)• oo, we obtain that the probability of eventual extinction 
is {p/X)'^ ioi \ > p and 1 otherwise (see e.g. (8.59) in [Bai64]). 

By elementary computations using these probability mass functions, we obtain the following propo¬ 
sition (cf. (8.16), (8.17), (8.48), (8.49), (8.54) and (8.55) in [Bai64]). 

3.2 Proposition 

(i) For X ^ p, we have 

E{Yt) = and Var(Yt) = 

A — fjj 

(ii) For A = /U > 0, we have 

E{Yt) = 1 and Var(yf) = 2Xt. 


Now we derive the expected reciprocal of the population size at time t given the population has not 
gone extinct until time t and state an upper bound: 


3.3 Proposition 

(i) For X ^ p, we have 


E( — 
Y 


17 > 0 = 


X — p 


- A 


log 


X — p 


< 


X — p 


(ii) For X = p > 0 we have. 

Proof: 

(i) We obtain for A / /U 


Eli 


E 


1 

Y 


17 > 0 = 


XeP-pt _ X 

log(l At) 


log 


X — p 


Xt 


17 > 0 = 


E 

n=l 


Pnjt) 


n — p 

n=l 

X — p 




VAe(-^-i)‘ 

1 - A 

XeP-p)t — X I ^g(A-/i)t _ „ 

n=l ^ 


E 


X — p 


< 


Ae(^-i)‘ - A 
X — p 

Ae(-^-i)* - A 
X — p 

Ae(^-i)* - A 


- A 

Ae(^-i)* - p 
Ae(^-M)i _ n 


- log ( 1 - 
log 


log 


X — p 
X 


X — p 


-|- (A — p)t 


-|- (A — p)t 
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(ii) We obtain for A = > 0 


E 




CXD 


E 

n=l 


Pn(t) 

n 


1 -Po(i) 


(1 + At) ^ 

n=\ 


1 

n (1 + At)^+1 


1^1/ At y _ log(l + At) 
At n I 1 + At j At 

n=l ^ ^ 


□ 

In the remaining of this subsection, we only consider the case where A > /x. In this case, we also need 
an upper bound for > 0), which is given by the following corollary. 

3.4 Corollary 

For X > jj, and t > log(2), we have 


E 


1 


At > 0 < e 2 




.vr, 

Proof: For A > /U and t > log(2), it follows from Proposition 3.3: 

1 


E 


%/E 


A* >0 1 < WE( 1 

If 


Yt>0 


< 


X — n 


\eP-Y)t - A 


log 


A 


X — n 


+ (A — /i)t 


< W^^i-^Aogf^ ) + (A - 


Ae(A-M)t 


X — ^ 




□ 


3.5 Remark 

Using Jensen’s inequality, we obtain an asymptotic lower bound for the conditional expectation con¬ 
sidered in the previous corollary: 

By the formula for po{t) given at the beginning of this section, it follows that 


lim E(yt|yt > 

t^OO 


lim E{Yt)e-^^-f^'>^ 

t^OO 


X 


X — fi 


X 


X — fi 


( 1 ) 


for A > /U, and since the function a; i—is strictly convex on the positive real axis, Jensen’s inequality 
yields 


E 



yt >0 


1 

y^E{Yt\Yt>0y 


( 2 ) 


Equation (1) implies 


lim — ezU 

y'EiYtlYt > 0 ) 




A — /X 
A 


Thus we obtain by (2) that E(-^^^|lt > 0) does not converge to zero at a faster rate than e zU 
as t —)■ oo for A > /X. 
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3.2 The number of births and deaths 


We consider a linear birth and death process {Yt)t>o with birth rate A, death rate fj, and initial value 
one as before. For convenience, we assume X ^ fi in this subsection. 

Let Bt be the number of births up to time t. Then {Bt)t>o is in general no Markov process since the 
number of births depends on the population size. However, the two-dimensional process {Bt,Yt)t>o 
is always a Markov process (see Example 6.4 in [CM77]). Let G{w,z;t) := be the joint 

probability generating function. Then a partial differential equation can be derived for G (see (39) 
on page 265 in [CM77] or (43) in [Ken49]), which can be used to compute G (see (40) on page 266 

in [CM77]). Moreover, the partial differential equation for G leads to the following partial differential 

equation for the joint cumulant generating function K{Qw,Qz',t) = log(G(e®”,e®^;t)) (cf. page 271 
in [CM77]): 

i^ = (Aee"+®--(A + rt+A<e-®-)^. (3) 

Since G is an analytic function, K is also analytic (as a composition of analytic functions). Thus for 
(0^, 0 , 2 ) from a ball around 0, we can write K in the form 

iL(0^,0,;t)= (4) 


where Hij denote the corresponding cumulants. By inserting (4) into (3), we obtain 



By equating coefficients, this yields the following differential equations for the cumulants: 


— Koi(t) = (A - 

^^/^02(t) = (A - H)KQ2{t) + ^^-^Koi(f) 

4-Kio(t) = AKoi(t) 
at 

— Kll(t) = (A - + XK02(t) + XKoi{t) 

^^'«20(t) = XKiiit) + ^AKoi(t). 


(5) 

( 6 ) 

(7) 

( 8 ) 

(9) 


Note that Kio(t) = E{Bt),Koi{t) = E{Yt),K 2 o{t) = Var(Hi), Ko 2 (i) = Var(yt) and Kn{t) = CoY{Bt,Yt) 
(cf. e.g. Subsection 5.1.2 of [CJIO]). Thus from (5) and (6) follows Proposition 3.2. The differential 
equation (7) yields 


E{Bt) 


^ c(A-0t _ ^ 

A — /i X — ij, 


if we assume Bq = 1. The differential equation (8) implies 


CoY{Bt,Yt) 


X{X + fi) 2(x-^,)t _ 2A// 

(A-^)2 X-f, (A-^)2 
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From (9) follows 


dt 


H2oit) 


2A^(A + /i) 2(\-u)t _ f (\-u)t 
( A -^)2 X -^, 


which implies 

^ A^(A + ^) 2(x-i,)t _ 4AV f (\-u)t , 

(A-^)2 

We summarize these results in the following proposition. 



{X-f,y ^ 


2X‘^H 

(A^ 


X{X + (x-^)t 

(A-^)V 


3.6 Proposition 

If we assume Bq = 1, we have 

E{Bt) = ^ (10) 

A — /i A — fi 

Cov{B,,Yt) = 

3.7 Remark 

The formula (10) can also be computed by using the differential equation 

jE{Bt) = AIE(yt), 

which can be derived by conditioning on Yt (see e.g. Section 2 of [CS12] or proof of Lemma 1 in 
Section 4 of [BL09]). Let Dt be the number of deaths up to time t. Then we have the analogue 
differential equation 

^IE(A) = MlE(ht), 

which implies 

E(A) = —^ 

X — ^ A — /i 

if we assume Dq = 0. 


3.8 Remark 

Note that ^{Yt) = lE(Rt) — K{Dt) and ~ ^ ~ ratio of the probabilities 

of a birth and a death at each event time. Furthermore, E(Ri) + K{Dt) = is the 

expected number of events up to time t. 


3.3 The age distribution 

Finally, we consider the age of an individual picked uniformly at random at a fixed time T > 0 in 
the linear birth and death process iYt)t>o (given Yt > 0). We briefly call the distribution of this 
age the age distribution of {Yt)t>o at time T (not to be confused with the “age distribution” treated 
for example in [Har02]). It is known that the age distribution converges to the Exp(A) distribution 
as T —)■ oo; see e.g. [HJV05]. In the pure birth case, the age distribution can be easily derived from 
known results, such as Theorem 1 in [NR71]. In the general case, the situation is far more complicated. 
However, we finally obtain the age distribution for any fi > 0. 




3 Distributions related to linear birth and death processes 


3.3.1 The age distribution in the pure birth case 

3.9 Proposition 

Let = 0. The ages of the individuals at time T that have been born later than the first individual are 
i.i.d. and their distribution is the truncated exponential distribution with parameter A. Consequently, 
the age A of an individual picked uniformly at random from all individuals at time T that were not 
alive at time 0 satisfies C{A) = C{Z\Z < T), where Z is Exp(A) distributed. 


Proof: Let Yt = n he given. From Theorem 1 in [NR71] follows that the n — 1 positive birth times 
in the time interval (0,T] are distributed as the order statistics of i.i.d. random variables T 2 , ■ ■ ■ ,Tn 
that have the cumulative distribution function 


F{fj < u) 


1 — exp(Att) 
1 — exp(AT) 


This implies ages T — T 2 ,..., T — T„_i are i.i.d. and have the cumulative distribution function 


P(r -fj<u) = l- F{fj <T-u) 


1 — exp(AT) — 1 + exp(A(T — u)) 
1 — exp(AT) 


1 — exp(—Art) 

1 — exp(—AT) ’ 


which is the cumulative distribution function of the truncated exponential distribution with parameter 
A. Since this distribution does not depend on n, this conditional distribution is equal to the uncondi¬ 
tional one. □ 


Before we turn to the general case, we give the following corollary. 

3.10 Corollary 

Let /i = 0 and let A be the age of an (arbitrary) individual at time T and 0 < b < T. Then the 
conditional distribution of A—b given A > bis the exponential distribution with parameter A truncated 
ed, T — b. 


Proof: Let Z be Exp(A) distributed. Then for the age A and z + b <T, Proposition 3.9 implies 


F{A-b< z\A >b)= F{Z <z + b\b<Z<T) 


F{b<Z<z + b) 
F{b<Z< T) 


—-^^=V(Z<z\Z<T-b) 


g-Afe _ g-A(2+fe) 
g-Ab _ g-AT 


and z i-T- F{Z < z\Z < T — b) is the cumulative distribution function of the exponential distribution 
with parameter A truncated at T — 6. □ 


3.11 Remark 

From the proof of Proposition 3.9, we know that we obtain the same distribution in Proposition 3.9 
and Corollary 3.10 if we condition on the population size Yt- Note that the age distribution of an 
individual picked uniformly at random conditioned on Yt follows immediately. The unconditional age 
distribution can be easily deduced from the conditional one by applying Proposition 3.3. The result is 
omitted here because the cumulative distribution function of the age distribution for arbitrary fi > 0 
is stated in Corollary 3.13 below. 
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3.3.2 The age distribution in the general case 

We consider the general case with birth rate A > 0 and death rate jj,. For convenience, we assume 
IJ, > 0 although all results also hold for ^ = 0. 

The following theorem gives us the age distribution at some time T conditioned on the population 
size. 


3.12 Theorem 

Let Fy^ denote the cumulative distribution function of the age of an individual picked uniformly at 
random at time T given Yt = vt for some yr > 0. Then Fy,j, is given by 


(^) 


Vt — ^ f e — e 1 /A(l — e —/u(l — e 


Ut 


1 - 


1 — e 


+ 


Vt 


X — H 


+ l{i=r} 


for t G [0, T] if A / y and by 

l/T — 1 


Fyrit) —- 


Ut 


1 - 






for t G [0, T] if A = y > 0. 


Before we prove this theorem, we state two corollaries. The first one reveals the unconditional age 
distribution of an individual picked uniformly at random. 


3.13 Corollary 

The cumulative distribution function F of the age of an individual picked uniformly at random at 
time T is given by 


Fit)= 1- 


X — fi 


+ 


XeF FT — X 
X — n 


log 


Ae(^ ^ 

A — y 


1 - 




1 — g 


XeF-FT _ A 


log 


_^g(A FT _ /\(^l _ g Aii) _ ^(1 _ g >'ty 


X — n 


X — n 


4t<T} 


+ ltt= 


{t=T} 


for t G [0, T] if A / y and by 


for t G [0, T] if A = y > 0. 


Proof: The expression for the cumulative distribution function follows directly from Theorem 3.12 
and Proposition 3.3. □ 


Note that the cumulative distribution function in Corollary 3.13 converges to the cumulative dis¬ 
tribution function of the Exp(A) distribution as T —)■ oo. The following corollary states that the age 
distribution converges exponentially fast to the Exp(A) distribution in a certain sense for A > y. 
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3.14 Corollary 

Let A > /i and let A denote the age of an individual picked uniformly at random at time T in the 
linear birth and death process {Yt)t>) (given It > 0). Then there exists a random variable Z with 
C{Z\Yt > 0) = Exp(A) such that 


E 




Yx >0 < 


A 


1 


+ 


X — fj, 


c + A — 1 Ae^"^ — A 


log 


X — H 


+ (A — fi)T 


for any c > 0. 


In order to prove Theorem 3.12 and Corollary 3.14, we use results from [BPS12] about contour 
processes (also known as exploration processes), which we introduce now. 


Bijection between piecewise-linear functions and trees 
3.15 Definition 

We consider piecewise-linear functions h : [0, b] —)■ [0, oo) for 0 < 6 < oo with the following properties: 

• The derivative is either 1 or -1 at all points where it exists.^ 

• h{x) > 0 if and only if x G (0, b) 

• All non-zero local minima are at distinct heights. 

We identify each such function h : [0, b] [0, oo) with the function h : [0, oo) —)■ [0, oo) that fulhls 

( hix) if 0 < X < 6 
0 if X > 6. 

Let % denote the set of all such functions. 

Now we introduce a bijection <1> between % and the set T of finite rooted binary trees. If not 
stated otherwise, we draw trees in a left-aligned way in this subsection, i.e. each inner node has a 
vertical line going in and out, and one horizontal line going to the right (see Figure 1). Horizontal 
lines are for constructive purposes only, whereas the vertical line segments represent actual edges and 
the corresponding edge lengths in the tree. Further below we will identify such trees with realizations 
of birth and death processes that also include information about the lifetimes of the individuals 
represented by the lengths of the vertical lines. 

Starting from a piecewise linear function h gT-L, we define <h(/i) by drawing a tree under the graph 
of h as follows (see also Figure 1): 

Let (xmax,i) hmax,i) be the leftmost local maximum of h. Then we draw a (vertical) line from (xmax,!, 0) 
to (Xniax,i,hmax,i)- Let (xmax, 2 , hmax, 2 ) be the second leftmost local maximum and (xmm,i, hmm,i) 
be the leftmost non-zero local minimum if they exist. In this case we add a (vertical) line from 
(a:max, 2 ,hmm,i) to (xmax, 2 , hmax, 2 ) and a (horizontal) line from (xmax, 2 , hmm,i) connecting the vertical 

^ Note that the formulation in [BPS12] is slightly more general. However, we do not need this more general setting for our 
purposes. 
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line horizontally to the rest of the tree. We continue this procedure until we have explored all non-zero 
local extrema (see Figure 1). 

It is clear that <I> is a bijection since we obtain the inverse of by assigning to any (left-aligned) tree, 
where the horizontal position of the root and the horizontal line lengths have been suitably adapted, 
the function that starts at (0, 0) and “turns” at every leaf and every time it hits a horizontal line until 
it hits the x-axis again. 

Probability laws of the contour process and the corresponding random tree 

Now we define probability measures on % and T. Let for the moment 0 < A < ;U. We define the 
contour process as a stochastic process {Hx)x>o whose trajectories belong to Let {Uk)keN and 
(hfc)fceN be two mutually independent sequences of independent, identically exponentially distributed 
random variables with parameters /x and A, respectively. Let for all k and let be 

the law of the random element of T-L with the following properties. Starting with a first local minimum 
at (0,0), the heights of the first local maximum and the second local minimum are Ui and (Zi)"*", 
respectively. The process is stopped whenever it returns to zero. If {Zi)~^ > 0, the heights of the 
second local maximum and the third local minimum are U 2 + Zi and (Zi + ^ 2 )"*"; respectively. If 
{Zi -|- ^ 2 )^ > 0, we continue in the same way (see Figure 2). 



It can be shown that this process returns to zero almost surely if 0 < A < /x (see Section 2 of 
[BPS12]). 
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3 Distributions related to linear birth and death processes 


In Theorem 3.16 below, we relate to the law of a linear birth death process with the same 
parameters. Since we are also interested in the supercritical case, we also define the law of a random 
element of % that can be related to the law of a supercritical linear birth and death process with birth 
rate A and death rate fj, < X now. Since the contour process defined above does not return to zero 
almost surely if A > ^, we need to modify the construction. Therefore, we take the process from above 
and add a deflection at T: The j-th time the process reaches T, it stops and goes down with slope 
— 1 for a time min(T, Zj) where Zj ~ Exp(A) independent (also from all other random variables). It 
can be shown that this deflected process returns to zero almost surely (see Section 2 of [BPS 12]). Let 
denote the law of the deflected process. Note that Pa,^,oo = 

Let (5A,/i,T be the law of the random tree corresponding to a linear birth and death process with 
parameters A and ^ killed at time T, i.e. all individuals alive at time T— are killed at time T. From 
a realization of such a birth and death process, including information about the lifetimes of the 
individuals, draw the corresponding random tree as follows. Draw the lifetime of the individual that is 
alive at time 0 as the leftmost vertical line. For each of its direct offspring attach a horizontal line and 
draw their lifetimes as further vertical lines. Starting with the latest offspring and keeping the lower 
horizontal lines long enough will keep the tree planar. Continue in the same way for the offspring of 
each offspring and so on. Note that the lifetimes of the individuals correspond to the lengths of the 
vertical lines. Obviously, the ages of the individuals at time T in a linear birth and death process are 
just obtained by the total lifetimes of the individuals alive at time T— in the corresponding killed tree 
(see Figure 3). 

We allow T = oo, in which case the birth and death process is not killed. 

The following theorem reveals the simple relationship between and 

3.16 Theorem 

For any A, /U > 0 and T G (0, oo), and also for T = oo if A < ^, we have 

= Px,fjL,T^~^- 

Proof: [BPSI2], Theorem 3.1. See the proof of that theorem to verify the precise construction of the 
left-aligned tree based on the birth and death process. □ 

Proof of Theorem 3.12 and Corollary 3.14 

We first prove that we obtain the same distribution if ages are read off from the tree under the 
vertically mirrored contour process {HT—t)o<t<T, where r = inf{x > 0; = 0} (see Figure 4). To 

see this, denote by T: T —)• T the corresponding transformation on the set of trees. Let D ~ Qx,^i,t 
be the random tree obtained from the above birth and death process and denote by Q{d) the finite 
set of trees that have the same vertex positions in the vertical direction as a tree d G T, but not 
necessarily the same phylogeny. Note that ©(4'((i)) = 0(d) for every d G T. Given 0(D), the random 
tree D is uniformly distributed on 0(D), and so is 'k(D) because, restricted to the finite set 0(D), 
the map T is just a permutation. Hence the distributions of 'I'(D) and D are equal by integration of 
the conditional distributions given 0(D). 
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3 Distributions related to linear birth and death processes 



Figure 3: The deflected contour process and the corresponding tree representing a linear birth and 
death process up to time T 



Figure 4: The mirrored deflected contour process (the realization corresponds to Figure 3) and its 
(left-aligned) tree 


Since it is more customary to draw stochastic processes from left to right, we mirror the situation 
in Figure 4, which gives us the original contour process {Hx) again, enveloping a right-aligned tree 
(constructed in the analogous way as the left-aligned tree, but from right to left; see Figure 5). Of 
course the ages still correspond to vertical line lengths. 

Obviously, the local maxima of the contour process at height T correspond to the living individ¬ 
uals in the linear birth and death process at time T, which we assume to be numbered from left 
to right in the right-aligned tree here. In total, there are Bt local maxima (at any height) corre¬ 
sponding to the Bt individuals that have been born up to time T. Let these maxima be numbered 
from left to right and let Ii < ... < /yj, be the indices of the maxima at height T and Ik = oo 
for k > Yt- Then ..., ^Iyj, ^ ^ tbe ages we are interested in, where (I4)fcgN is the 

sequence of random variables from the definition of the contour process, i.e. {Vk)k£N ^re i.i.d. Exp(A) 
distributed. Let 0 = Wi < ... < W2 Bt+i = t be the positions of all local extrema of {Hx)o<x<t- 
Note that W 211 ,..., IF 27 y^ are the positions of the maxima at height T. For convenience, we define 
Wk = W 2 BT +1 = k and IFoo = hm Wk = r. 

k^oo 

By the above construction of the contour process, the process {Hwt,, (—l)^)fceN is a Markov chain 
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on [0,T] X {—1,1} with initial value (0, —1) at time 1 and transition probabilities 
(-1)^+1) G Si X S2|(Siy„(-l)") = {zuZ2)) 

+ fi f e~^('^~^^^du)Si(B2) if Z2 = —1 

^ ^ ^ Bin[zi,T) ^ 

(<5o(Si)e-^^i + A f e-^(^i-“)du)<5_i(S2) if ^2 = 1, 

, ^ Sin{o,2:i] ^ 

where Si is a Borel subset of [0,r], S2 C {0,1} and (2:1,22) G [0,T] x {—1, 1 }. Note that the second 
component of (Swj,, (—l)^)fceN tells us if {Hx)o<x<t has a minimum (if it is —1) or a maximum (if it 
is 1) at Wk- Note further that this Markov chain determines {Hx)x>o completely. 

We define the sequence (Cfc)fceN of hitting times in (T, 1) recursively by 

6 = inf{fc G N : (Siy,, (-1)^) = (T, 1)} and 6+1 = mf{fe > ii : {Hw,, (-1)"^) = (T, 1)} 

for all I G N, where inf 0 = 00 . Note that each is a stopping time and that max{/ : < 00 } = Yt- 

Moreover, we dehne the “horizontally mirrored” excursions 

^(0 = = (T - Hwk)ii<k<ii+i 

for I G N, where we set 00 — 00 = 0. 

Since P((Sii/^^, (—1)^*) = (T, l)|^i < 00 ) = 1, the strong Markov property and the fact that the 
second component of the process is deterministic imply that {Hwk)o<k<ii and {Hwu)ii<k are in¬ 
dependent given < 00 and that {Hwt,)^i<k has the same distribution given < 00 for any 
/ G N. As a consequence, we have for any I G N that {Hwf,)o<k<^n ■ ■ ■ ■, are in¬ 
dependent of {^r)r>i+i) given < 00 and hence that {Hw^)o<k<^i, E^'^\ ..., are 

independent of E^^'^ given < 00 and given any sub-u-algebra of a{^r]r > ^ 1). This implies 

that {Hw,)o<k <^,, EW , ^(2),..., (/-i), Em are independent given ^; < 00 and given any sub-cr- 

algebra of ^ + 1) for any I G N. Note that for any I G N, the first I — 1 “horizontally 

mirrored” excursions E^E^E^^^... all have the same distribution under this conditioning be¬ 
cause E^^\ E^) ^^ all have the same distribution given < 00 and E^^'> ,E^‘^\ ..., are 

independent of (Cr)r>«-i-i given < 00 . 
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Since {Yr = i/t} = {^yj, < oo,^yj,_l_i = 00 } we obtain that, given Yt = yx, the processes 
..., independent and identically 

distributed. By having a closer look at the strong Markov property used above, it is seen that neither 
the distribution of E^^'^ given Yt = yx ^ nor the distribution of given hr = 2/r > 1 depend 

on the concrete value yr- Note that on {Yx = yx}, we have e[^'^ = Vi^,... ^E^"^ and 

E^t) _ /\ E, so these are the ages we are interested in. The conditional distributions of E^'^ 

and given Yx = yx can be easily rediscovered as conditional distributions in another contour 

process, which is what we look at next. 

Let {Hx)x>o ~ Ek',\,T- Theorem 3.16, {Hx)x>o is the contour process corresponding to a linear 
birth and death process with hirth rate y and death rate A that is killed at T. Let the positions {Wk)k 
of the extrema of {Hx)x>o be defined analogously to {Wk)k- 

Both and for arbitrary I G {1,... ,Yx} start in 0 and alternate between 

independently adding Exp(A) and subtracting Exp(/i) random variables at least until 0 or T is crossed. 
Thus until this happens, they have the same distribution. We have {Yx > 2} = = 0} = 

{maxfcE^^^ < T}, and therefore for any yr > that, since the distribution of does not depend 
on the concrete value of yr, 


/:(eW I Yx 


Vt) = E[ H, 


V 


W 2 


max Hxxr 
k 



C 



maxEj, < T 

a;>0 


( 11 ) 


On the other hand we have {Yx > 1} = {Yx > l,maxfcE^^^^ = T} (in fact ^Yt+i = 00 and is 

eventually absorbed in T). Therefore for any yx > 1, since the distribution of does not depend 

on the concrete value of yx, 


C{E^f^'> I Yx 


Ut) = e( H, 


V 


W 2 


max H.ir 
k 



C 



max Hx = T 

x>0 


( 12 ) 


In other words, we have identified the desired age distributions at time T given Yx = yx SiS the 
distribution of the lifetime of the hrst individual in a linear birth and death process with birth rate 
fi and death rate A conditioned on extinction of the process by time T (Equation (11), hrst yr — I 
individuals) or conditioned on survival of the process up to time T (Equation (12), last individual, 
lifetime measured up to time T). 


We hrst compute the age distribution of the hrst yx — ^ individuals given Tr = y-r > 2. By Bayes’ 
Theorem C{H^^ \ max 2 ;>o Hx < T) has a density 


fS) oc Ae-^¥ 


( maxEa; < T 

V 2 ;>o 



(13) 


where t < T. By Theorem 3.16, we have to compute the probability that the linear birth and death 
process with birth rate y, and death rate A goes extinct before time T given the lifetime of the hrst 
individual is t. Note that the birth times of the offspring of the hrst individual form a Poisson process 
with rate y (up to time t). By conditioning on their number (up to time t) and their birth times, and 
by the formula for the extinction probability from Subsection 3.1, we obtain that this probability is 
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equal to 


/ ^ jii ^ ^ J ''j ' ~ * '' ''' ^ ~ 


k=0 


kl 


1 _ Ae(A-At)(r-q) ''' ^ _ Ae(A-/i)(T-4) 

0 0^ M 




A:=0 

oo 


g-(A-/i)(T-«) _ 
-(A-M)(r-«) _ A^^ 


1 / \ 

= ^ -e-^D log(Ae(^-^)('^-') - /i) + At - log(Ae(^-^)^ - j 

for t < T if A / /i. For A = /x > 0, we see analogously that this probability is equal to 


(14) 


(At)"„-Ati / r A(r-ti 


f [ _ 

h 'J 1 

fc-O Q Q 

fc=0 


_ A(T — tk 

+ A(T — ti) 1 + A(r — tk) 


dti... dtk 


k=0 

OO 


i + A(r-u) 


^ ^e-^*(log(l + A(r - t)) - log(l + XT) + AT)' 

A:=0 

i + A(r-t) 


(15) 


1 + AT 

Now we can easily compute the normalizing constant of the density /* in (13); For X ^ we obtain 
T T 

J Ae-^*(Ae(^-^AT-i) _ = J - ;ue-^*)dt 

0 0 

= e(^“^^^(l - e“^^) - (1 - e“^'^) 

= - 1 

and for A = /X > 0, we have 

T T 

„ — I \ /rp J.\\ J4- Xl „ — I \Tn\ f \2„ —AXj 


Ae-''*(1 + A(T - t))dt = (1 - )(1 + AT) - / X^e-^Hdt 


0 


0 


= (1 - e"^^)(l + AT) - (1 - e"^^(AT + 1)) 
= AT. 


Thus the density /* is given by 

X^{\-k)T^-\t _ 


f*{t) = 


g(A k)'^ — 1 


Xe-^^ - 

1 — g~(A— 
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for t G [0, T] if A / and by 


f*{t) = 


Xe-^\1 + X{T - t)) _ e-^\l + A(T - t)) 


—Xt / 


XT T 

for t G [0, T] if A = ^ > 0. By integration, we obtain that the cumulative distribution function F* for 
the age of any of the first ut — ^ individuals given Yt = vt takes the form 


F. (t) = 1 - 


g-At _ 
1 — 


(16) 


and 


F(f) = 1 - 


e-\t{T_t) 

f 


for t G [0,r], respectively. By multiplying the probability that, given Yt = yx, the randomly picked 
individual is one of the hrst yx — ^ individuals, we obtain the hrst summands in Theorem 3.12. 


3.17 Remark 

In particular, we see that 

l-F(t) 


g-At _ g-(A-/i)rg-/^t e“'*'*(l — 

1 — — g—(A—/i)r 


= e 




for t G [0, T] if A / /X and 


1 - F (t) 


< e-" 

T 


for t G [0,T] if A = ^. Thus, given Yx = yx, the age distribution of the first yx 
stochastically dominated by the Exp(A) distribution. 


1 individuals is 


Before we determine the distribution of the last individual in a similar way, we prove Corollary 3.14. 
We can explicitly define the random variable Z in the statement of this corollary. Given Yx = yx, let 
Jx be uniformly distributed on {l,...,yx}, independent of all other random variables, and let A be 
the age of individual Jx, where the individuals alive at time T are still numbered from left to right in 
the right-aligned tree. 

Let Z be an arbitrary Exp(A) distributed random variable that is independent of everything else 
and let Fo denote the cumulative distribution function of the Exp(A) distribution. Set 

Z . 11{J7.(aj)<yy(a;)}-^oo (■^*(^(^)) 4” 


Then Z is Exp(A) distributed given Yx > 0 because conditionally on Tr = l/T > 0 it is Exp(A) 
distributed both given Jx = yx and given Jx < yx- The latter holds because F is a continuous 
function. We have for c > 0 


E 


( 

1^ 

1^ 


V 

ct 

1 

a 



Yx > 0 


'Yx - 1 
= E( — -E 


Yx 


^-cA _ g-cZ 


< E^E 


_ g-cZ 


Yx, Jx <Yx ] + —E 

Yx 


Yx > 0 


_ g-cZ 


Yx, Jx = Yx 


Yx > 0 


Yx, Jx <Yx ] + — 

Yx 


(17) 
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By Remark 3.17, the Exp(A) distribution stochastically dominates the distribution corresponding to 
F*. Thus we have Foo(^) < F,(A). Moreover, given Jt < Yt, we have Z = Since F^^ 

is increasing, this implies that, given Jt < Yt, we have A < Z almost surely. Thus the first summand 
in the outer expectation on the right-hand side of (17) is equal to 


El 


Yt, Jt < YrJ = E(e-^^|yr, Jt < Yt) - E(e-‘^^|yT, Jt < Yt). 
Since, given Yt = vt > ^ and Jt < Vt, the random variable Z is Exp(A) distributed, we have 

-cZ~ - - - ^ 


(18) 


E(e-'^^|Jr < Yt) = 


A “1“ c 


For the first expectation on the right-hand side of (18), we compute 


W.{e-^^\YT,JT < Yt) = 


1 


1 _ e-(A-FT 
1 


(Ae-(A+c)t _ 


< 


1 — g ('^ 

A 

A -|- c — 1 ’ 


(^(1 _ e-(^+'=)^) - ^e-(^-^)^(l - 


where we assume X > /j,. Thus for X > /J,, the right-hand side of (18) is smaller than or equal to 

A / 

A-Fc - 1 

For the second summand in the outer expectation on the right-hand side of (17), we use the upper 
bound from Proposition 3.3. 


< 


A 


1 


c -h A - 1 ■ 


(19) 


In order to prove Theorem 3.12, it remains to compute the age distribution of the last individual 
given Yt = vt- We do this analogously to the procedure for the first 2/r — 1 individuals. 

Recall from (12) that we have to compute £(i7^^ | maXj,>oi7a; = T). Note that has a density 
/ with respect to the measure Le6[o,r) + ^T given by 


ht) 


Ae if 0 < t < T 
e"^'^ if t = T. 


Thus by Bayes’ Theorem, the age distribution of the last individual given Yt = ht has a density f* 
with respect to Leb^Q T) + ^T satisfying 


/ 

\ 


/ 

Ae--^¥( 

max Hr = T 

( max Hx = T 

V ^>0 

^W2 ^ 

= < 

\ 

^-XT 

V 2;>0 




e 



^W2 ~ ^ 


if 0 < t < T 
if t = T. 


Firstly, we consider the case where X ^ fi. By (14), we have that 


P 


( max Hx = T 

V ^>0 



1 


’( max 77a; < T 

V *>o 



= 1 - 


Ae(A F{T t) 
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for t G [0, T), which implies 


f*{t) oc 


Ae(^-M)T_^ II U ^ t < J 


if t = T. 


We can compute the normalizing constant of f* as 


e-"^ + 


Xfi 

_\g(A-/i)r _ ^ 




(e-^* - e-^^)dt 


^ -XT , A(l-e ^'^)-^(l-e 

^ ' X r \ ..\rn 


Xei^ ^ 


3.18 Remark 

By Bayes’ Theorem, the normalizing constant of the density f* is just the probability that a linear 
birth and death process with birth rate ^ and death rate A survives up to time T. Thus we could 
also have used the extinction probability from Subsection 3.1 in order to obtain the right-hand side of 
( 20 ). 

We may conclude that f* is given by 


rit) = 


if 0 < f < T 
- if t = T. 

A —II 


By integration, we obtain that the cumulative distribution function F* for the age of the last individual 
given Yt = ut takes the form 

^ (t) — -7-ll{i<r} + ll{i=r} (^1) 

A — /i 

for t G [0,T]. 

For X = fj, > 0, we proceed analogously: By (15), we have that 


PI max Hx = T = t ) = 1 — P ( max Hx < T R,?. = t] = 1 — 


1 + A(T — t) Xt 
l + AT “ l + AT 


for t G [0,T), which leads to 


f*{t) oc 


Ae"^*Y^ ifO<t<T 


if t = T. 


We compute the normalizing constant of f* as 


e-AT + / g-At 


dt = +_ - _ e-^'^ = _i_ 

l + AT ^1 + AT l + AT' 
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This yields that f* is given by 


nt) 


if 0 < t < T 

(1 + XT)e-^^ ift = T. 


By integration, we obtain that the cumulative distribution function F* for the age of the last individual 
given Yt = ut takes the form 


F*{t) = (1 - e-^*(l + + l{t=T} 


for t G [0,T]. 

By multiplying the probability that given Yt = ut the randomly picked individual is the last one, 
we obtain the second summands in Theorem 3.12. □ 

3.4 Upper bound for the time since the last event 

Let 0 = Ti < T 2 < ... be the event times of our linear birth and death process. Since Bt + Dt is 
the number of events up to time T, the random variable T — Tbj,+Dt describes the time passed since 
the last event as seen from time T. Note that, given the population size Yt = ut, the distribution 
of the difference Tbj,+Dt+i ~ Tbj,+Dt between the previous and the next event time at time T is 
typically larger than the Exp(y'r(A +/u)) distribution due to the inspection paradox. Therefore, we do 
not bound T — Tbj.+Dt from above by Tbj.+Dt+i “ Tbj.+Dt- Instead we use that, given Tr = i/T > 2, 
the time since the last event is bounded from above by the minimum of the ages of the first yT — ^ 
individuals, where the individuals are ordered as in the previous subsection. We know that these 2 /t ~ 1 
ages are are i.i.d. and that their distribution is stochastically dominated by the Exp(A) distribution 
by Remark 3.17. Since the minimum of yr ~ 1 independent Exp(A) distributed random variables, is 
Exp((y'r — 1)A) distributed, we obtain the following theorem. 

3.19 Theorem 

Given Yt = vt, the distribution of T — Tb^+Dt is stochastically dominated by the distribution with 
cumulative distribution function 


G{t) = l{,,>i}(l - 


4 Degree distribution at finite time 

In this section, we deduce the degree distribution at finite time T. 

Eirstly, we consider the case where the nodes do not die, i.e. y = 0. Let the nodes be ordered by 
their birth times. Let Si be the social index of node i and Ai{T) its age at time T. Eor convenience, 
we dehne A 2 {T) = 0 if = 1- Eurthermore, given Yt = j/T; let the random variable Jt be uniformly 
distributed on {1,... ,yT} and independent of the ages, the social indices and the rest of the path 
(lt)o<t<r- We interpret Jt as the index of a node that is randomly chosen at time T among all living 
nodes. 
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4.1 The number of outgoing edges in the pure birth case 


Firstly, we condition on the population size ht-, the randomly picked node jx, its social index Sj^ and 
its age Ojj, at time T. Furthermore, we condition on the age 02 of the second node if yr > 1 and set 
02 = 0 otherwise. If yr > the number of living edges that jx creates during its lifetime can be 
described by a birth and death process with constant birth and linear death rate started in zero at 
time T — ainax(iT,2) since no edges are created as long as there is only one node. The birth rate is asjj, 
and the death rate is the product of 13 and the number of (living) edges that jx has created. Such a 
birth and death process is examined in Chapter XVII of [Fel67] and can be interpreted as a queueing 
system with constant arrival rate and inhnitely many servers. We denote the state probabilities at 
time t of such a process started at a general state i with constant birth rate A and linear death rate 
n/2 by Pnit), where n is the current state of the process. This process is described by the following 
Kolmogorov forward differential equations: 

- (A + njl)Pn{t) + fl{n + l)Pn+i{t) for n > 1 (22) 

^ = f,P,it)-XPo{t). (23) 

CXD 

Thus we have for the generating function P{s,t) = Pn{i)s^ for all s with |s| < 1: 

n=0 


. 0 . ^ E Pn{t)s'^ 00 .. 

dP{s,t) _ ^=0 _ dPn[t) - 

Wt “ Ft “ ^ dt ^ 

n=0 
00 

= E» 

n=l 

= {l-s) {-\P{s, t) + , (24) 

where the second line follows from the Kolmogorov forward differential equations (22) and (23). The 
solution of the partial differential equation (24) is given by (cf. [Fel67], page 481) 


^XPn-i{t) - (A + nj2)Pnit) + j2{n + l)Pn+i{t)^ + i3Pi{t) - XPo{t) 


P{s,t) = exp( -A(l - s) ) (1 - (1 - s)e-Py. 




(25) 


The function P{- ,t) is the product of the generating function of the Po(^(l—e“^*)) and the Bin(z, e~P) 
distribution and hence the generating function of their convolution. In particular, we obtain for i = 0 
the generating function of the Poisson distribution with parameter 

A(1 - e-P) 

A 

This result can be applied to the edge process we are interested in: Conditioned on the population 
size yxi the chosen node jx, its social index Sj^ and age the distribution of the number of edges 
created by jx is the following Poisson distribution: 


Po 


i 


as 


3T 


V P 


(1-e 


-/3an 
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where the Po(0) distribution is the Dirac measure in zero. Note that we obtain the Dirac measure if 
only one individual is alive at time T, i.e. yx = ^, since we set 02 = 0 in this case. 

Consequently, the corresponding unconditional distribution is the following mixed Poisson distribu¬ 
tion: 

MixPo ^—~ 

4.2 The number of incoming edges in the pure birth case 

Recall that the nodes are ordered by their birth times. Firstly, we condition on iYt)o<t<T = iyt)o<t<T, 
the social indices {Sk)keN = {sk)keN and Jt = jx- Let T = ai > ... > ayj. denote the corresponding 
ages of the individuals. 

Consider a fixed node i G {1,, yx} \ {jr} and some time interval of the form [T — ai,T — a;+i) 
for I > iy 2. We only consider edges that are created by the fixed node i. Then the number of edges 
that connect i to jx and that survive until the end of this interval, i.e. until the birth time T — a/+i of 
node I + 1, can be described by a birth and death process again. The population size is constant and 
equal to I in the interval [T — ai,T — a;+i). Thus the birth rate is also constant and equal to asij^ 
because node i creates edges at rate asi and is the probability that an edge that is created in the 
interval [T — ai,T — a;+i) is connected to jx. The death rate is linear again with factor /3. As in the 
previous subsection, we obtain a Poisson distribution, i.e. the number of edges that are created by i 
in [T — ai,T — a;+i), connect i to jx and survive until T — a;+i follows the distribution 

Now the distribution of the number of edges that i creates in [T — a^, T — 0 ^+ 2 ), connect i with jx and 
survive until T — 0^+2 can be deduced from (25). In order to do this, let Z be the number of edges 
that i creates in [T — ai,T — a^+i), connect i with jx and survive until T — a^+i. We have already 
shown that Z is Po(^l^^(l — distributed. First, we condition on Z = z. Then the 

number of edges that i creates in \T — ai,T — 0 ^+ 2 ), connect i with jx and survive until T — 0^+2 
is described by a birth and death process with constant birth rate linear death rate with factor 
/3 and initial state z. For such a process, the probability generating function (25) was stated in the 
previous subsection. Since (25) is the product of the probability generating function of a Poisson 
distribution and of the probability generating function of a binomial distribution, we obtain the sum 
of a Po(^(l — distributed and a Bin( 2 :, distributed random variable for 

time T —a;_|_ 2 , where these two random variables are independent. The second one is a Poisson mixture 
of binomial distributions if we do not condition on Z = z. This mixture is the Poisson distribution 
with parameter Consequently, it follows that 

V (^ ~ w 

is the desired distribution. 
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4 Degree distribution at finite time 


We obtain inductively by mixing and independence that 

/ Vt-^ \ \ 

Po| ^ g-^(ai+fc-ai+fc+l) I __("x _ g-/3(ai-ai+i)^ I 

\=iVjT ^ fc=l ^ ^ 

is the distribution of the number of edges that connect i to Jt surviving until T - 
Repeating the same argument as above, it follows that 

1 . . 7 1 


a, 


lyr- 


Pol 


2/t—1 \ \ 

n ^-I3avj,\ I „-l3avj, ( TT -/Sjai+k-ai+k+l) \ ('1 

is the distribution of the number of edges between i and Jt being alive at time T. 

By independence of the edge processes of edges generated by the various nodes i, we obtain the 
following distribution for the total number of incoming edges in node Jt- 

' yr yr J/T-l /Vt-l-l N N 

(1 _ g-/^°i/T) -g g-/3(ai+fc-ai+fc+i) I _ g-/3(ai-ai+i)^ I 

- i=l ~ i=l l=ivjT ^ fc=l / (^ “ / 

i¥=jT 

\ 


Po 


/ yr 
' a V-- 


--iVjT 

yr ?/T —1 

.(l_e-/3a^T) +" ^^(e-/3«i+i _g-/3«i) 

^ i=l ^ ^ 

conditioning, we obtain a mixed Poisson distribution: 

/ Q 

r I ^ \^ 

E yyv 


.Po(: 

hus without 


V /5 ^ yr - 1 

i¥=jT 


/I 


4.3 Degree distribution in the pure birth case 

By the independence of the numbers of incoming and outgoing edges, the previous considerations 
imply that the degree distribution of a randomly chosen node is a mixed Poisson distribution. The 
random parameter of this mixed Poisson distribution is the sum of the random parameters of the two 
mixed Poisson distributions that were deduced in the previous two subsections. Thus we obtain the 
MixPo(A'r) distribution, where 


Aj' 


C^Sjrp , 




k(^J'P ,2 


"D+^E 


Yt 


Si 


2 = 1 
ij^Jr 


Yt - 1 


(1 -e 


-pAy^j 


(D) 


+ i E E‘ 

2—1 l=iy J'T 
ii^Jr 


_ e-pAi(T)y 


( 26 ) 


4.4 Degree distribution in the general case 

If the nodes can die, the degree distribution can be determined similarly to the pure birth case. 
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4 Degree distribution at finite time 


Let 0 = < ... < be the birth times. Recall that 0 = Ti < ... < Tbj,+Dt ^re the event 

times up to T. Moreover, let T~ denote the death time of node i for each i, where the nodes are still 
numbered according to their birth times. Finally, given the subset of {1,... , Bt} that contains the 
indices of all living nodes at time T, let Jt be uniformly distributed on this set and independent of 
all other events as before. Note that Sj^, is (stochastically) independent of {{Yt)o<t<T-, Jt)- 
The crucial differences compared to the procedure for the pure birth case are: 

• We have to condition on Tr > 0 since there has to exist a node at time T that can be picked 
randomly. 

• We have to consider [T;,T/+i) instead of [T — Ai(T),T — Ai^i(T)) as time intervals in which the 
parameters of the process of incoming edges are constant. 

• We do not sum over all nodes i, but over all that are still alive at time T. 

• In general, the population size does not increase almost surely by one per time interval. Thus 
the population size is not necessarily equal to I in the time interval [Ti, Ti+i). 

Since deaths of other nodes reduce the number of outgoing edges, we cannot derive the distribution 
of the number of outgoing edges at time T in the same way as for the pure birth process. However, 
we can proceed in a similar way as for the incoming nodes in the pure birth case. Therefore we do 
not derive the distribution of the numbers of outgoing and incoming edges separately here. 


We condition on (Tt)o<t<r = {yt)o<t<T, the social indices {Sk)k&n = (sfc)fceN and Jt = Jt- Let br 
and Jt denote the corresponding number of births and deaths up to time T, respectively. Furthermore, 
let 0 = ti < t 2 < • • • < tbj,+dT and 0 = < ... < he the corresponding event times and birth 

times up to time T, respectively. We consider a fixed node i / Jt that is alive at time T (provided 
there are any) and some time interval of the form for ti > tf V and < T. Note that 

the nodes i and Jt create edges with rate asi and asjj ,, respectively, and that the probability that an 
edge that is created by i is connected to Jt is and equal to the probability that an edge that 

is created by Jt is connected to i. Thus the number of edges that connect i to Jt and survive until 
time can be described by a birth and death process with constant birth rate a{si + and 

linear death rate with factor /3 like before. Analogously to Subsection 4.2, we obtain that the number 
of edges that are created in [ti,ti+i), connect i to jx and survive until t;+i has the following Poisson 
distribution: 


Po 


a{si + s 

iVti - l)/3 




Let the function r = : {!> • • •; ^r} —^ {1, • • •, &T + dx} be defined such that tJ = for all 

j G {1,... , bx}, i.e. r maps birth number to event number. Note that the sum bx + dx of the number 
of births and the number of deaths up to time T is the number of events up to time T. 
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5 Bounds on the total variation distance between the finite time and asymptotic degree distribution 


Applying the same iterative procedure as in Subsection 4.2, we can see that 
Po 


bT+dT—l ybT+dT—l—1 ( \ \ \ \ 

g-/3(r-4y+dr) X] f n Q-l^i^i+k+i-k+k)?!:^ -_ g-/3(h+i-ii)) j 

l=r(i)Vr(iT) ^ k=l ^ ^ 


l=r{i)Vr{jT) 

is the distribution of the number of edges between i and Jt being alive at time T. 

Let the death times be denoted by ..., Then by independence of the processes of edges 

generated by the various nodes i, we obtain the following conditional degree distribution: 


bj' 

ME 


a{si + Sjj.) 


6'j’ 


^ iVT - l)/3 

Mjt 

b']p-\-dj' — 1 y ybqp-\-dqp—I — 1 




2=1 


-l-aT —J- / yUT-tu^T—i — i- \ f \ \ \ 

i)Vr{jT)^^ fc=l ^ ^ 

/ I 

Pn( - + gjT I /I _ -/ 3 (T -4 )n| 


Mjt 

+ “Vl V + Mt . -/3(T-t,+i) /3(r-fi)^| 


Mj't 


l=r{i)Vr{jT) 


Thus the degree distribution is MixPo(A^), where T(A^) = £(AT|Lr > 0) and 


i^Jt 


Bt 


Bt-\-Dj '—1 


+ E '^{T->T} (y _ 1) 

i=l l=r{i)VriJT) ^ ^ 

Zy^JT 


V ■S'j + ^Jt ( -fj(T-Ti+i) _ .-p{T-Ti 

(Vrr, 


))1 




(27) 


5 Bounds on the total variation distance between the finite time and 
asymptotic degree distribution 

Since in the pure birth case we obtain a much better bound with considerably less work, we treat the 
cases /i = 0 and fi > 0 separately. 

5.1 The pure birth case 

Firstly, we consider the pure birth case, i.e. ^ = 0, again. Fix T > 0 and let the random variable 
At be defined by (26). Furthermore, let Sj^ and Aj^ be independent random variables such that 
= Sj^{uj) and Aj^{uj) := F^^{Ft{Aj^{T,uj))) for all uj £ where Ft and F^o are the 
cumulative distribution functions of Aj^{T) and the Exp(A) distribution, respectively. Note that 
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we obtain from Proposition 3.9 that C{Aj^) —)■ Exp(A) weakly as T —)■ oo. The random variable 
Aj^ is Exp(A) distributed since Ft is continuous and hence Ft{Aj^{T)) is uniformly distributed on 
[0,1]. Moreover, since the exponential distribution dominates the truncated exponential distribution, 
we have Fao{Ajj,{T,io)) < Ft{Aj^{T,oj)) for all a; G fl. Since F^ is increasing, it follows that 
Aj^{T,uj) < Aj^{oj) for all a; G fl. 

Let 5 be a generic random variable that is distributed according to the distribution of the social 
indices and let 

+ (28) 

We know from Subsection 4.3 that MixPo(A'r) is the degree distribution at time T in the pure birth 
case. Theorem 5.2 below implies that MixPo(A'r) converges at rate of just a bit slower than 
to MixPo(M) as T ^ oo, which is the asymptotic degree distribution already stated in Section 3.2 of 
[BLIO]. Note that these theorems are much more powerful as they give an exact distance bound for 
every finite T. 

5.1 Theorem 

We have 

dTy(MixPo(AT),MixPo(M)) < 
where ag is the standard deviation of S. 

The main idea of the proof is to make use of Theorem 2.1 to obtain EdA-p — M|) as an upper 
bound for the total variation distance on the left-hand side. In order to bound EdA-r — M|) further, 
we use that the expected value of the second summand of the right-hand side of (26) becomes small 
as T —)■ oo since the age Ayj,{T) of the youngest individual at time T converges quickly to 0, and 
compare the other summands of the right-hand sides of (26) and (28). 

Eor the comparison of the last summands in (26) and (28), respectively, we note that the average of 
the social indices becomes close to E(5) by the Law of Large Numbers. Then we use again that the 
age Ayj,{T) of the youngest individual at time T converges quickly to 0 and that Aj^{T) converges 
quickly to Aj^ Proposition 3.9. 

Einally, the expected absolute value of the difference between the first summand of (26) and the 
first summand of (28) again becomes small since Aj^(T) converges quickly to Aj^. 

We prove Theorem 5.1 in detail after the next theorem, which gives us the exponential rate for the 
convergence of the degree distribution. In order to reveal this rate, we further estimate lE(^) and 
E(-^^) on the right-hand side in Theorem 5.1 by using Proposition 3.3 and Corollary 3.4. 

5.2 Theorem 

For T > we have 

(iTy(MixPo(Ar), MixPo(M)) < as'/Te~ 2 ^'^ + 4aE(5) (t + ^ 

VA V P(P + A) 

where ag is the standard deviation of S. 
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5 Bounds on the total variation distance between the finite time and asymptotic degree distribution 


Proof: The inequality follows directly from Theorem 5.1, Proposition 3.3 and Corollary 3.4. □ 

5.3 Remark 

For the original model of Britton and Lindholm, which allows loops, we can adapt the proof of 
Theorem 5.1 in such a way that the upper bounds in Theorem 5.1 and Theorem 5.2 remain exactly 
the same. 

Proof of Theorem 5.1 

From Theorem 2.1 follows 


dTV'(MixPo(AT),MixPo(M)) < IE(|M - At|), 
and IE(|M — A^l) is smaller than or equal to 


E 




2=1 


i —1 l=i\/Jj' 


< -E 
- 13 


Yt 

E 

2=1 


S^ 

Yt-1 


_ -payPty 


(1-e 








Yt Yt-1 




i=l l=i 


Yt Yt-1 


E(5)(l - ^ ^(i^(e-/3A+i(T) _ 


2—1 l=i\/J'T' 

iYJr 


*5joc(l - 


(29) 


where we use the convention ^ := 0. 


For the first line of the right-hand side, we have 
Yt 

K 

Yt-1' d V 


E 


E 

2=1 

i^Jr 


Yt 


Si 


2=1 

iYJt 


Yt - 1 


(1-e 


-pAyn 




Yt, Jt 


iYJt 


(30) 


where the last equality holds since, given Yt, the age Ayt{T) and Jt are independent. 

Given Yt = ut > 1, the age Ay^(T) is the minimum of i/t — 1 i.i.d. truncated exponentially 
distributed random variables by Proposition 3.9. Since the minimum oi i/t — 1 independent Exp(A) 
distributed random variables is Exp((y'r — 1)A) distributed, the distribution of Ay^{T) is stochastically 


28 




5 Bounds on the total variation distance between the finite time and asymptotic degree distribution 


dominated by the Exp((y'r — 1)A) distribution. In general, we have for random variables X and Y 
with X <st Y that f{X) <st f(Y) and consequently IE(/(X)) < IE(/(y)) for every increasing function 
/. Thus the right-hand side of (30) is smaller than or equal to 


Note that 


E 


1 

Yt - 1 




^Pn{T) A ^ 1 / 

^ n — 1 ^ n — 1 V 

n=2 n=2 ^ 

A ^l/ Ae^^-A A" 
“ Xe^'^ — X ^ n I Ae^'^ ) 

n=l ^ ^ 


Ae^^ ) 


Pn{T) 

71 


= E 


A 

X^ 



Xe^^- 

Xe^^ 




n 


where Pn{T) is the probability mass function from Subsection 3.1. Thus the right-hand side of (30) is 
smaller than or equal to 

For the second line of the right-hand side of (29), we obtain 


E 


i=l l=i 


= E 


1=2 ‘ i=l 


= E E 


i=l 
i^Jt 

Yt-1 ^ I 


(e( ^ ^ (5i-E(5))(e-^^*+i(^)-e-^^'('^)) yr,Jr)) 


^-1 . , 
1=Jt\/2 I —1 

i^Jr 


E (5i - E(5)) |e ("e-^^*+i(1 _ i'^YX+i (T )))| 

\=J^V2 i=2 ^ ^ ^ 


< E 


1=JtY2 

Yt-1 


1= 
Yt-1 


E An E Er(5i - E(5))2E(1 - 

--JtV2 ^ ^ A=i V / 


< 


E^ V ^^o-sE(l - 

\=JtV2 1 ^ 


(31) 


where the fourth line holds since the second sum in the third line is (stochastically) independent 
of Jt, the social indices are independent of all other random variables and, given Yt, the index 
Jt is independent of the social indices and ages, and the second last line is obtained by applying 
EjZ - E(Z)\ < yVAA toZ=j^ Y!i=2 Si- 

Given Yt and A;+i(T), the difference Ai{T) — Aij^i{T) is the minimum of ^ — 1 i.i.d. truncated 
exponentially distributed random variables by Proposition 3.9, Corollary 3.10 and Remark 3.11 (for 
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5 Bounds on the total variation distance between the finite time and asymptotic degree distribution 


I > 2). Thus it is stochastically dominated by an Exp((/ — 1)A) distributed random variable Zi, where 
Zi can be assumed to independent of Ai^i{T). This implies for a > 0: 

FiAi - A+i < a\YT) = rnAi - A+i < a\YT,T- A+i = x)P(T - A+i G dxlYr) 

Jo 

> FiZi < a\YT). 


Thus, also given Yt alone, the difference Ai{T) — Ai^i(T) is stochastically dominated by Zi. By the 
same argument as for (30), it follows that (31) is smaller than or equal to 


ME 






ME 


1 /3 I 


<w(^ n 1 /g 2(^ - 1) 
- VE ^,/rri(;-i)A Yt 
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Since we arranged Aj^{T) < Aj^, for the third line of the right-hand side of (29) we obtain 

EE(,S)(l-e-^^‘^-)- ^ ^^(e-dA+iiT) _ 

i=l l=i\/Jj' 

i^Jr 

Yt-1 . I 

= E E(,S)(1 - -E(5) Y ]—f E 
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i^Jr 
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= EE(5)(l-e"^^‘'“)-E(5) Y (e"^^*+i/^/- 

1=JtV2 

= E E(S’)(1 - - E(5)(e"^^yT(^) - (jr,2)(T))) 

= E^E(S’)(1 - e-^^yT(^)) +E(5)(e"^^”“('^T.2)('^) - 
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= E(5)Eri 


I 3.9 


+ E(5)(E(e 


. 

By Proposition 

]g(g-/3^max(JT^,2)(r)) = / _^j^ dx = --^ 

Jo 1 — e 1 — e 

whence follows that (32) is smaller than or equal to 

Since we arranged Sjj, = Sj^ and Aj^, (T) < Aj ^, 
we have 

E 5j^(l - - 5j^(l-e"^^“-(‘^T,2)(^)) = 


Wx(jr.2)('r)) _]E(e-/5^-^oo)). (32) 


_ f e-(/3+A): 

or equal 


' dx = - --- 
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(33) 


1^ < 


iE(S)E(i)+E(S)^^ 

Aj^, for the fourth line of the right-hand side 


1' 


5joc(l - - 5j^(l-e-^^“-(‘^T,2)(^)) 

uie that the right-hand side of (34) is the 
12), which is smaller than or equal to 


of (29) 

E(5)(E(e 
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wx(jy,2)('r)^ _]£(g-/3Aj^^^^ (34) 

I 

; of (34) is the same as the second summand of the 
equal to E(5)^^p^T3Y above). 


; right-hand side of 


Altogether, we obtain the following upper bound for the right-hand side of (29); 

1 






□ 


5.2 The general case 

5.2.1 The main theorem 

Let Sj^{u}) = Sjj,{uj) as in the pure birth case. Furthermore, let Aj^ = Z, where Z is the random 
variable from Corollary 3.14. From the proof of this corollary, we see that 

^Joo(^) JT(<^)<yT(‘*^)}Eoo (E*(Ajy(T,cu)) + l|jy('^)_y,j,(^)j.Z(a;), 

where is the cumulative distribution function of the Exp(A) distribution as before, Z ~ FAo 
independent of everything and F^{t) = 1 — ^(see also (16)). Note, given > 0, we 
have Aj^ ~ Exp(A). 

Recall the definition of the parameter random variable Kt from (27) and let be a random variable 
with C{AXp) = C{Kt\Yt > 0) as before. Moreover, set 

M := ^^(1 - ) + ^^ii^(l - ) (35) 

B + B /3 + M 

and let M* be a random variable with £(M*) = C{M\Yt > 0). 

We already know that MixPo(A^) is the degree distribution at time T in the general case. Theo¬ 
rem 5.4 below implies that MixPo(A^) converges at a rate of just a bit over to the MixPo(M*) 

distribution, which is the asymptotic degree distribution stated in Section 3.2 of [BLIO], as T —)■ oo. 
Note again that this theorem is much more powerful since it gives an exact bound for the total variation 
distance for every finite T. 
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5.4 Theorem 

Let 0-5 < oo be the standard deviation of S. Then for T > ^ 


drv (MixPo (), MixPo (M*)) 


< a 


' f bVe A 


2 A-/i ' 5 

“ 1 “ cr/3 

+ 3V2aAcJ5r^e"(^"^)^ 


+ 7 + ( /? + I 
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)4 


E(5)T^e 


■ 2 p-(A-M)T 


Note that the right-hand side is of order 


0(r2)e"5A-A‘)'r 


as T —)■ 00 . 

The main idea of the proof of this theorem is the same as in the pure birth case; We make use of 
Theorem 2.1 to obtain E(|A 7 — M| \Yt > 0) as an upper bound for (MixPo(A^), MixPo(M*)) and 

establish a further bound for this expected value. In order to do so, we use that the expected value of 
the first summand of the right-hand side of (27) converges quickly to zero since the time T — Tbj,+Dt 
since the last event before T converges quickly to 0, and compare the remaining summand of the 
right-hand side of (27) with the right-hand side of (35). For this comparison, we make vital use of the 
fact that the average of the social indices of the nodes living at time T is close to £(5) by the Law 
of Large Numbers again and that, given T; for some large I G N, the percentage of the nodes living 
at time Ti that survive up to time T is approximately by the Law of Large Numbers (see 

Lemma 5.17). 

A further important ingredient is that the reciprocal of the node process (17)i>o conditioned on 
survival is a supermartingale (see Corollary 5.8), which makes it easy to deal with the expected value 
of its maximum. Finally, we also use that the age Ajj,(T) of the randomly picked individual converges 
quickly to Aj^ by Corollary 3.14. 

The fact that nodes may die complicates the procedure considerably since the population size after 
a fixed number of events is random in this case and additional dependencies have to be treated (e.g. 
the inter-event times depend on the random population size at the previous event time). 

In order to cope with additional dependencies, we introduce a deterministic number k{T) that 
depends on T such that the probability for more than k{T) events up to time T decreases exponentially 
in T (see Definition 5.9 and Lemma 5.10(ii) below). Essentially, we substitute the number of events 
Bt + Dt up to time T by k.(T) in the proof of Lemma 5.19 below since it is difficult to treat the 
dependencies between the number of events up to time T and the event times. 

In order to cope with additional dependencies on the random index Jt, we note that the probability 
that the node that is randomly picked at time T was born up to time ^ decreases exponentially in 
T (see Lemma 5.10(i)). Thus we essentially only have to consider the time interval [^, 00 ) instead of 
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the interval [T^(jy),oo), where T^(jy) is the birth time of the randomly picked individual. The choice 
of ^ as the left endpoint of the interval makes sure that we always have a large number of individuals 
in the time interval with high probability. 


5.2.2 Some lemmas of general interest 


Before we prove Theorem 5.4 in detail, we formulate several important statements that are used in the 
proof and could also be useful in other settings. The first one gives us an expression for the extinction 
probability given that the process survives up to time T. In order to simplify the notation, we define 
Too := lim Yt G {0,oo}. 

t^OO 

5.5 Lemma 

For the conditioned extinction probability given Tr > 0, we have 


P{Y^ = olTr > 0) = 

A 

Proof: By conditioning on the population size, we obtain 

P(yoo = OlTr > 0) = E(P(yoc = 0|yr)|yr > 0) = 


Yt 


Yt > 0 


(36) 


since (^/A)™ is the extinction probability of a linear birth and death process with birth rate A, death 
rate ^ and initial value m (see Remark 3.1). 

On the one hand, we have 


E 





= E 



Yt > 0 I P(yr > 0) + E 



Yt > 0 


{l-po{T))+po{T). 


P(yr = 0 ) 


On the other hand, we can make use of the known formula for the probability generating function of 
Yt (see III.5 in [AN72]) to obtain 


E 




If 

X' 


This yields 


E 



Yt > 0 


(^-po(r))(i-po(r))-i 


p{X-p) Xe^^ - p ^ ^ (a-m) 
A(AeA“^)'^ — fi) {X — ~ X 


□ 


For the probability of Tr = 1 conditioned on y^ > 0, we have the following lemma. 


5.6 Lemma 

We have for T > log(2) 


p(yr = i|yT > 0) < 


2(A - p) 
XeP-pT 
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Proof: Using the probability mass function p from Subsection 3.1, for T > log(2), we obtain 


P(yr = 1 |Ut > 0) 


PijT) 

l-Po(T) 


1 - Xp{T) 


< 2 (A - p) 


□ 


We next consider sub- and supermartingal properties of conditioned processes. 

5.7 Lemma 

(i) iYt)t>o conditioned on Foo = 0 is a supermartingale. 

(ii) iYt)t>o conditioned on Foo > 0 is a submartingale. 

Proof: 

(i) Consider a subcritical linear birth and death process {Yt)t>o with birth rate p, death rate A and 
initial value one. Then {Yt)t>o has the same law as (Yt)t>o conditioned on Fqo = 0 (see e.g. 
page 78 in [Lam08]), which can be proved by using the embedded generation process (see e.g. 
Section IV.3 of [AN72]) and the corresponding result for discrete-time branching processes (see 
e.g. Theorem 3 in Section 1.12 of [AN72]). It is well-known that a subcritical linear birth and 
death process is a submartingale, which yields the result. 

(ii) Consider a process {Yt)t>o that has the law of {Yt)t>o conditioned on Fqo > 0 . Note that (Fi)t>o 
inherits the Markov property from (17)t>o- Furthermore, it is well-known that (lt)i>o is a 
submartingale. Thus we obtain for t > s > 0 and G N 

E{Yt\Ys = Vs) = E{Yt\Ys = y.,Foo > 0) 

^ = Vs) 

P(Foo > OjF^ = ps) 

_ E(V|F, = Vs) - E(l{y^=o|F,|W = Vs) 

P(Foo > OlF^ = ps) 

^ Vs - P(Uoc = 0|n = VsMYtlYs = Ps) 

~ P(Foo > 0|Fs = Ps) 

^ Vs - P(Uoo = o|F^ = Ps)ys 

~ P(Foo > OlF^ = Ps) 

= Vs, 


where (lt)t>o is the supermartingale from (i), which also inherits the Markov property from 
(Yt)t>o- Thus {Yt)t>o conditioned on ultimate survival is a submartingale. □ 


Lemma 5.7 yields a useful result about (1^ ^)i>o conditioned on ultimate survival: 

5.8 Corollary 

(Fj“^)t>o conditioned on Fqo > 0 is a supermartingale. 
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Proof: In general, for a submartingale {Zt)t>o with respect to a filtration {Pt)t>o with Zt > I for all 
t > 0, we have for t > s > 0 


E 





< HZs 


Zt\J^s) < 0 . 


Thus {Z^ ^)t>o is a supermartingale. Consequently, Lemma 5.7 implies that {Y^ ^)t>o conditioned on 
ultimate survival is a supermartingale. □ 


Now we introduce the deterministic number k{T) as mentioned before, which helps us to cope with 
dependencies. Furthermore, we define the random number 1C{T) as index of the last event time before 
such that r’(Tr) > /C(T) is equivalent to and M.t as the sum of the number of births 

and the number of deaths to simplify notation. 

5.9 Definition 

Let k{T) := and /C(r) ;= max{A: : such that = Yr almost surely. 

Moreover, let Mt '■= Bt + Dt be the number of events up to time T. 

The following lemma implies the properties of k.{T) and JC{T) that were stated before and that we 
need to prove the convergence rate for the degree distribution in the general case. 

5.10 Lemma 


(i) For the probability that fewer than JC{T) events have occurred up to the birth time of the 
randomly picked node Jt was born, we have 


(r(Jr) < 1C{T)\Yt > 0) = | 




Tt > 0 

\ — fi 


\eP — A 


log 


X — ^ 


+ (A-m)T 


(ii) For T > a*)) ^ have 


P(K(r) < Mt\Yt > 0) < 

\X T'-j 


Proof: 

(i) By Remark 3.17, we have 

P(r(JT) < JCiT)\YT > 0) = P(T,(j^) < r^(T)|YT > 0) = f(t - | 


Yr > 0 


< E 


Yt- 1 1 

— - + 

Yt Yt 


XT 


Yr > 0 ) < e 2 ^^ + E 


Yr 


Yr > 0 . (37) 


By Proposition 3.3, the right-hand side of (37) is smaller than or equal to 




X — fi 


XeP — X 


log 


A 


X — fi 


+ {X-fi)T 
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(ii) Using Proposition 3.6, we compute 


t(r) - 2'K{Bt) > - 1 - 


2A 

X — IJ, 




X — 


= (^2^.T_ 2A ^ 3^ - A 


6 2 


X — ^ J X — n 


Since T > /^)) ^ have 


1 2/.r i(A+/.)r > 2A 


2 X — IJ, 

Thus for T > ^ ^j^g right-hand side of (38) is larger than or equal to 

e(A-A‘)rlg2/.Tgi(A+M)r 3^ - A 


X — II 


Since for T > ^(i°g(^A^^iog(A ^)) ^ have 


g(A-M)T > 1 + (A - ^)T > 1 + 2 log>1 + 2 log(4) > 2, 


(38) 


(39) 


(40) 


(41) 


inequality (39) implies that for T > ^('°g(^A^jog(A ^l)) ^ expression (40) is bounded from below 


by 


= (A-+t 3 O^T i(A-g+T , 3^ ^ ^ ^x+^l)T^ 

8 X — IJ 8 


Thus for T > ^(^°g(4A^^iog(A fi)) ^ have 


k{T) - 2E{Bt) > > 0. 


(42) 


Consequently, we can apply Chebyshev’s inequality and obtain 


P(k(T) < Mt\Yt > 0) < 


< 


P(k(T) < Mt) / P(K(r) < 2Bt) 

F{Yt > 0) - P(yr > 0) 

P(yr > 0) ^^^^^^ ~ 2E{Bt) < 2Bt - 2E{Bt)) 

1 4Var(i37’) 

P(yr > 0) (k(T) - 2E{Bt)Y ’ 


(43) 


where T > ^('°g(^A^^iog(A ^l)) ^ P(Tr > 0) > ^(+00 > 0) = Proposition 3.6 and (42), for 
T > ^('°g(^A+iog(A-/j)) ^ right-hand side of (43) is smaller than or equal to 


256 A Var(i?r) 30A / A^(A + n) ix+iAt , 2A^/i 2 (a-i-m)t 

9 X — Ii e3(A+/i)r “ X — (A — ij)^ (A — 11 )^ 


(44) 


Since e ^ by (41), the right-hand side of (44) is smaller than or equal to 

60A^(A + Ii) rx+ii)T 
(A - i^r 

□ 
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5.2.3 Further lemmas 


Now we give further statements that are used in order to prove Theorem 5.4. 

The following lemma states that the time T — T_Mj, since the last event becomes small quickly. 

5.11 Lemma 

For T > log(2) and c > 0, we have 


IE(1 - > 0) < ,^l + vE 


1 


-Ij 


Yt > 0 


' A^V^T-1 

Proof: Let X have the cumulative distribution function 

G{t) = l{v,>i}(l - 

Conditionally on Yt, we then have T — Tj^^, <st X by Theorem 3.19, which implies 
E(l-e"^('^"^'^T)|yr > 0) < E(1 - e"^^|yr > 0) 


(45) 


= E( (1 - e-^'^)F{X = TlTr) + E(1 - e-^'^jX < T,Yt)F(X < TITt) 


-cXi 


Yt > 0 


(46) 


Since P(X = TlTr = 1 ) = 1 and C{X\Yt) = Exp((yr — 1)A) on {Yt > 2 }, we obtain that the 
right-hand side of (46) is smaller than or equal to 


+ c+{^{-l)X^Vr>^ I U > O) 


Tr > 0 ) < E(yr = 1\Yt > 0) 

1 


+ tF 


For the conditional probability of Ft = 1, by Lemma 5.6, we have for T > jfrji log(2) 

2(A-m) 


Yt > 0 


E(yr = IITt > 0) < 


Ag(A fj.)T 


□ 


The following lemma gives us an upper bound for the conditional expectation on the right-hand side 
of (45) and is proved similarly to Proposition 3.3. 

5.12 Lemma 

We have 


E 


1 


Yt-1 


-1 




Yt >0 < 


A — /i 


Proof: We have 


E 


Yt-1 


-1 


{Tr>l} 


Yt>0] = 


XeP P)T — ^ 
X — fi 


log 


A 


\ — H 


+ {X-^x)T 


g 1/ Ae(^-^)^-A 


(47) 


Ae(A-/i)r _ ^ ^ ^ 

and in the proof of Proposition 3.3, we computed an upper bound for the series in (47) that yields 
that the right-hand side of (47) is smaller than or equal to 


X — fj, 


Ae ('*' GT' — 




log 


A 


X — jj. 


+ {X-^i)T . 


□ 


In order to prove our next lemma, we use the supermartingale from Corollary 5.8. 
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5.13 Lemma 

For all (5,7 > 0, we have 


E 


\ min 


l"oc >o] <e-^(^-^)^+Ef ^ 

IT 
2 


L„c > 0 


Proof: We obtain 

E^- ^ 


\ min 


> 01 < E ( E ( Y,-^<e-r(^-r)T} max ^ 

^ ^ ^<t '2- 


Yt 

2 


+ E ( E ( ^max 


Yt 

2 


1^00 > 0 


Yco>0 


fmaxl>e-i("-^)^ 

Yt ) 

Vi<i 



Yoo>0 


<e-7(A-M)T + Ej^^ 


where the last line follows from Corollary 5.8 and the the inequality (2) in Theorem 6.14 on page 99 
in [Yeh95]. □ 


We use the following lemma to bound the conditional expectation in the second summand of the 
right-hand side of Lemma 5.13 from above. 


5.14 Lemma 

We have for T > log(2) 


E( I Toe > 

I T 




Proof: For T > ^^^log(2), we obtain 


E 


Yt 

2 


Tx, > 0 = 


< 


E^Yj. ^l{y^> 0 }^ 


A 


< . E( Yy l|yj,>Qj j ^ 


nYoo >0) -A-// Vi 


< -^eI^ ^ 


X-12 \Yt 
2 


Yt > 0 
2 


< 


1 


- - 1 V VA - 


log( 


< 2 


where the second line follows from Proposition 3.3 and the last inequality holds for r>^iog(2 ). □ 

Before we continue to state and prove further useful statements, we introduce another random 
variable. 
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5.15 Definition 

Let Rti,t be the number of nodes that are alive at time T; and survive up to time T. 

The following lemma gives us the first two conditional moments of Rti,t- 

5.16 Lemma 

We have 

IE(i?T*,r|bTp7],ri+i) = {Yt, - 

-P(yT*+i = Yt, - 1\Yt^,Ti,Ti+i){2{Yt, - . 

Proof: Firstly, we determine the conditional expectation of Rti,t- 

^{Rti,t\Yti,Ti,Ti^i) = p'^E{Rti,t\Yti,Ti,Ti+i,Yti_^_^ = Yti + 1 ) 

+ P~E{Rti,t\Yti,Ti,Ti+i,Yti^^ = Yr^ - 1), (48) 

where p+ := F{Yt,^^ = Yt, + 1\Yt„Ti,Ti+,) and p- := P(yr,+i = - 1\Yt„Ti,Ti+,).^ 

With 

HRti,t\Yti,Ti,Ti+i,Yti^^ = Yti + 1 ) = E(iir,+i,T - 1sj,-_^^^^^^-^t^\Yti,Ti,Ti+i,Yti^^ = Yti + 1) 

= {Yti + (49) 

and 

E(i?ri,T|LrnF;,T/+i,yr,_^^ = Yt, - 1) = E{Rti_^_^,t\Yti,Ti,Ti+i,Yti_^_^ = Yti - 1). 

= {YT^ - l)e“^('^"'^*+i). 

Equation (48) implies 

E{Rt^,t\Yti,Ti,Ti+,) = yT,e-^(^-^'+i) - 

Secondly, we compute the conditional second moment of Rti,t' 

E(i?|,^|yT„rz,rz+i) =p+E(i?2^^^|yT„r;,rz+i,yT,^, = Yt, +1) 

+ P~E{R^^^j^\Yti,Ti,Ti+i,Yti^^ = Yti - 1). (50) 

We treat the summands separately again. For the case where a birth occurs at time T/+i, we have 

E(i?2^,^|yT„T;,T;+i,yT,^, = Yt, + 1) 

= E((i?T(+i,r - j,^f\YT,,TuTi+i,YTi^^ = Yt, + 1) 

= - 2Rti^^,t1{t-_^^^^^^>t} + Yt,,Ti,Ti+i,Yti^^ = Yt, + 1, Yt, > 0^ (51) 

^ We have and p~ = However, we do not need these explicit expressions for our purposes. 
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and further 


'E(-^Ti+i,rl^ri,7],r/+i,yr,+i = br* + 1) = Var(i?Ti+i,T|b"ri,7],r;+i,yri+i = br* + 1) 

+ = Yt, + 1))^ (52) 

Given yr^+i, let /iz+i be the set of the yrj+i nodes living at time Ti+i. Then by independence of 
various death times, we obtain for the conditional variance 


Var(i?T(+i,r|b'r,,7],Tz+i,yr,+i = br* + 1) 

= Yti + l,yri > 0^ 

= {Yti + 

For the second summand of (52), we obtain; 

E{Rt,^^,t\Yt,,TuTi+^,Yt,^^ = Yt, + 1) = {Yt, + 


= Var 


X] ^{T->T} 

r 


Thus (52) is equal to 


{Yti + 1) + yT,e-2Mr-T+i)^ _ 


For the remaining parts of (51), we have 


E(-RT,+i,rl{T- , >t}\^Ti,Ti,Ti+i,Yti^^ = Y^ + 1) 

= IF’(^,."i(/+i) > T\Yti,Ti,Ti+i,Yti^^ = Yti + 1) 

• E(-R7i+i,T|rTi(,+i) > T,Yti,Ti,Ti+i,Yti^^ = Yt^ + 1) 

= e-^(^-^*+i)(l + yr,e"^('^"'^*+i)) 


and 


E(i^^- ^^j|yT,,r;,Tz+i,yT,^, = Yt, +1) = 

^ r -*-(^ + 1) ^ 

Thus (51) implies 

E(i?2^,^|yT„rz,r;+i,yT,^, = Yt, + 1) 

= (Yti + 1) 


For the case where a death occurs at time T^+i, we have 


E(i?2^,r|yT,,r,,r,+i,yT,^, =yT, -1) 

= Var(i?T,+i,r|i"rpr^,Tz+i,yr,^, = Yt, - 1) + {E{Rt,^^,t\Yt^,TuTi+^,Yt,^^ = Yt, - 1))^ 

= {Yti - l)(e“^('^"'^*+i) - e-2M'r-T+i)^ _ y^2^-2^,{T-Tl+^) 
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= {Yti - - Wti + 2)e-2M'r-Ti+i)_ 

Thus from (50) follows 

+ p- ^2(1 - . 


Knowing the conditional moments of Rti,t, we can find an upper bound for a more complex conditional 
expectation involving Rti,t that appears in the proof of the main theorem below: 

5.17 Lemma 


For I E N, we have 




Proof: By Jensen’s inequality, we obtain 


e( ^t„Tz,Tz+i) 


- lTpT,,r,+i 


= 1 


PYt,>1}- 


_|_ g-2M(r-r;+i) 


Lemma 5.16 implies 


E{{RT,,T-^?\yp,Ti,Ti+^) 2 


(Yt, - 1)^ 


Yt,- 




Yt,- 






Yt.-I 


(Yti - i)2 ^{^Ti>BE((-^Ti,r - lf\YTi,Ti,Ti+i) 

= _ iy '^{YTi>n(^iR^i,T\YTi,Ti,Ti+i) - 2 E{Rti,t\Yti,Ti,Ti+i) + 

-P(yT,+, = Yt, - 1\Yt„Ti,Ti+i)(^2{Yt, - l)e-2Mr-^*+i) + 


Yt, - 1 


-l,(T-Ti+i) , {Yti>1} 

{Yt, - 1)2 
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< 


(Yt,- 




Yt,-1 




-/.(T-Ti+i) 


1 


+ 


{yT,>i} 


(Xt, - 1)2 

Yt,- 


_ ^^(-r-r,+i) , Yt^ 2M(r-r,+i) 

Yt,-1 (Yt, - 1)2 ^ Yt, - 1 >1}® 


1/y + 

Ft*-! ^ ^ (Ft,-1)2 


By (54) and (55), we can bound the right-hand side of (53) from above by 


(55) 


Yt,- 


-l{y,^>i} - l{y,^>i}e 2MT t,+i) +e 2^1^ Tm) j < 


< 


_ 6 _\ 2 


(56) 

□ 


The following purely analytical lemma is also used in the proof of Theorem 5.4. 


5.18 Lemma 

For all X > 0, we have 


^ ( —x)^ ^ x2 


k=2 


kl 


(57) 


Proof: Let x > 0. The left-hand side of (57) is equal to e * — 1 -|- x. For x = 0, this expression is 
equal to By 1 — e~^ < x, the derivative of the left-hand side of (57) is everywhere smaller than or 
equal to the derivative of the right-hand side of (57). Hence the statement follows. □ 


For the conditional expectation of the sum of the squared inter-event times since the birth of the 
randomly picked node, we have the following lemma. 

5.19 Lemma 

For T > (x^ log(2) V 2(l°g(^^Mog(-^-/^)) .^g 

( jYi'p — l 

E (Yi+i-TiY^ 

l=r{JT) 


Hr > 0 ) < e 


^ KnT-2LL±iil,-(A+c)T ,.2^.-1 




AT 


(58) 




A 


+ ;r2 + 


(59) 


2(A -|- /i) 


1 -h 2 log 


A — 
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Proof: For the left-hand side of (59), we deduce 
{Ti+^-TiY\Yt>0 

l=r{JT) 


Adj'—l 


< E 




Yt > 0 


+ > Q)T‘^ + > 0)T2. 


(60) 


Note that, given {YTf.)keN and JC(T), the inter-event times Tj^i — Tj are Exp((A -|- fi)Yj’j) distributed 
and independent for j > /C(T). In order to derive an upper bound for the first conditional expectation 
on the right-hand side of (60), we introduce a sequence of random variables {Uj)j^fq such that, given 
(^TjfceN and /C(T), Uj ~ Exp((A -h ^) min^(-r)<fc FrJ i-i.d. Then, given (iTjfceN and JC(T), we have 
Tj+i — Tj <st Uj for /C(T) < j < k{T) and obtain 


E( 1 


AAqp — l s 

{T.iT)>T}MTKiT)<Tw} ^ MTi>T(j^)}iTi+i - Ti) ^^^^^ma^^_^(T,+i - T,) Yt > oj 


< 


Eoo > 0 ) + ^P(Too = OlTr > 0). (61) 


By Lemma 5.5, the second summand is equal to 

4 A 

The first summand of (61) is bounded from above by 

-kIkI max Un 


T, 

2 


IC{T),{YT,)k>i 


AAo > 0 


< 


2 V Vi<i<«('r) 

1 


JCiT),{YT,)k>i 


AAo > 0 


T 

= -E 
2 


k{T) 


Ey 


(A -h fi) min Yt^ I 

K.(T)<k «=i 


Eoo > 0 , 


where the last equality follows from from the formula for the expectation of the maximum of i.i.d. 
exponentially distributed random variables (see e.g. the introduction of [Eis08]). 

Using the well-known upper bound for the harmonic sum yields 


T 

-E 

2 


1 


PT) 

_ V- 

(A-h|u) mm iTfc I 
K.(T)<k ^ «=i 


Too > 0 


< --E 


1 


2{\ + n) V min Yt^ 
K(T)<k 


Toe > 0 (log(Ac(r)) + 1) 


T ^ 
< —--E 


1 


2[\ + n) V min Yt^ 
K{T)<k 


Y^>0]{-{X + fi)T+l]. 


(62) 


43 




5 Bounds on the total variation distance between the finite time and asymptotic degree distribution 


For the conditional expectation in (62), we obtain for T > ^ 


E 


min Yt^ 
K{T)<k 


Ko > 0 = E 


1 


minlf 


< 1 + 2 log 


Yoo>0 

A 


X — ^ 




(63) 


where the last line follows from Lemma 5.13 with 5 = 1 and 1 = \ and Lemma 5.14. 

Thus we can conclude that the first summand of the right-hand side of (61) is smaller than or equal 
to 


3^2 T 

A - 

4 2(A -|- /i) 


1 + 21og(^-^^^ + (A - 


for sufficiently large T. 

For the last line of (60), we can use the upper bounds from Lemma 5.10 and obtain that for 
T > (x^ log(2) V ig smaller than or equal to 

QO0^j.2^-ix+,)T ^ ^ 2C^ (l°g(A^) + (^ - (64) 

Altogether, we obtain (59) for T > (^^log(2) V ), □ 


5.2.4 Proof of the main theorem 

In order to simplify notation, we introduce E*( ■) = E( ■ iTr > 0) and P*( ■) = P( ■ llr > 0). 


In the following we prove Theorem 5.4. As mentioned before, we use Theorem 2.1 to obtain 

(irv'(MixPo(A^),MixPo(M*)) < E*|At - M|. 

In order to find an upper bound for E^IA^ — M|, we use the triangle inequality for the absolute value 
after plugging in the definitions of A^ and M given by (27) and (35), which yields 

«(1 _ e-f^iT-TMr)) ^ 

IE* (Tr - l)/3 + SJt)'^{t->t} 

i^Jr 

Bj' ^ 

+ ^ ^ (-S'* + *S'jT)l{'7;->r} 

^ i=l ’ l=r 

- ^(E(S) +Sj„)(l- ) 

P I M 


(i)Vr(JT) 


This is bounded from above by 


E* 


a(l — e 

{Yt - l)/3 


Bt 




{T->T}^{>V>1} 


i=l 


(65) 
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'all^ 

+ IE*( — -Sj, Y. hT->T}MyT>i} 

i=l 
i^Jr 


{Yt - l)/3 


+ E=' 


+ E=' 


a 


Bt 


Aij'—l 


1 




Ai j- —1 


l=r(i)\/r{JT) 
r-^{l) 


^ I] (E(5) + ^JT)l{'7;->T}(e ^*+ 1 ^ - e Y 


l=r(JT) 


Yt,- 


2=1 

i^Jr 


a 


A^t~1 


+ E" 


fi + li 


a 


P + fi 


(E(5) + 5jy) Y 

l=r{JT) 

Jvi'j' — 1 

(E(5) + 5j^) Y 

l=r(JT) 


( 66 ) 

(67) 


( 68 ) 


(69) 


- ^(IE(5) + SjJ{l - , 

P -\- jl 

where Mt is the number of events up to time T and r~^{l) is the number of births that occur not later 
than the ^th event for all / G {1,..., Adr}, i-e. r~^ :{!,... ,A4t} —^ {1, • • • i—)■ Ylf=i l{r(i)<z}- 

In the following, we deduce upper bounds for (65)-(69). 


Upper bound for (65) and (66) 

We treat (65) similarly to the corresponding expression in the pure birth case, but condition on Bt, 
Dt, Jt and the information which nodes survive up to time T, and obtain 


E* 


a{l — e 

(Yt - l)/3 


Bt \ 

1 {W> 1 } E S^^^{T->T}) < ^E( 5 )r (1 - 

i^Jt 


(70) 


Since we condition on Jt, the same expression is also obtained for (66). 


Lemma 5.11 reveals an upper bound for the conditional expectation on the right-hand sides of (70): 
For T > log(2), we have 

Lemma 5.12 gives us an upper bound for the second summand of the upper bound from Lemma 
5.11: 

- AeA-/^)^ - /r ( A^) + ' 

Altogether, for T > log(2), we obtain that the expectation E*(l — jg gjx^aller than 

or equal to 

f (A-,, + ;s(log(3^) +(A-,.)T))e-<-^)-. (71) 
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Plugging this expression in the right-hand side of (70) results in the following upper bound for the 
sum of (65) and (66); 


4aIE(5)/ 


/^ + /3(log(^) +(A-^)r))e-(^-^)^. 


Upper bound for (67) 

Recall that Rti,t is the number of nodes that are alive at time T; and survive up to time T. We 
compute 


E* 


( Bj' 1 \ 

^ i=} l=r{i)Vr{JT) ^ 

. Mt-1 75 1 71 r 

^ ^ l=r(JT) *=1 

Rt. rr>2 


/ J^rp — \ 

<E* - ^ (e-/3(T-r,+i) _e-/3(T-T0)-Rr,,r-1^^ 


l=r{JT) 

Rti , t ^ 2 




•E 


1 


Rti,T — 1 


r-l(0 




2=1 

i^Jt 


{Yt)o<t<T, Jt 


(72) 


Note in the second line that we can restrict the sum to Rti,t > 2 because Rti,t > 1 by ^ > v^Jt) and 
a summand with Rti,t = 1 in the first line would be 0 anyway. 

Since the sum in the last line of (72) has exactly Rti,t — 1 summands of the form Si — E(S) and all 
other summands are zero, the right-hand side of (72) is smaller than or equal to 


E* 


j’ —1 

CX \ 

/=r(Jr) 

Rt, t^2 


(g-/9(r-r,+i) _ g-/3(T-Ti)) 


Rti,T — 1 

Er, -1 




0-5 


\/Rti,T - 1 


(73) 


Since e — e < /3(Ti+i — Ti) and Rti,t < Yti, the expression (73) is smaller than or 

equal to 


A4 j- —1 


(ys 


E* a y ^_ 


l{Yr^y,}{Ti+i-Ti)]<E*[a y 


A^t~1 


<ys 


Yn 


i=RJt) \IYt,- YR 


'^{YTi>i}iYi+i - 7 ])^ 


< V^aasT'E* 


max 




yr{JT)<k<MT-l ^/Y^J 

< YiaasTE' ^) 

+ YiaasTV . 

(74) 
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where /C(T) = max{/c : (see Definition 5.9). We have 

l{iC(T)<r(jT)! “ax 
V ^ ^ ^ r(JT)<k<MT-l 


< E( l{IC(T)<r(J'r)\ ™ax 

V ^ ^ ^ ^’K(JT)<k<MT-l 




Doo > 0^p*(yoc > 0) +1 • p*(yoo = 0), (75) 


where Yoo ;= lim Yt G {0,oo} as before. 

t^OO 


By Lemma 5.5, we have for the second summand of the right-hand side of (75) 




The first summand of the right-hand side of (75) is smaller than or equal to 


e( 


max 


KiTXk 


P'00 > 0 ) = E( 


., max —= 

V?<t VZ 


Too > 0 


By combining Lemma 5.13, where ^ \ and 7 = |, and Lemma 5.14, we obtain for T > log(2) 


El 


, max —= 


Too > 0^ < Too > 0^ 


< ' 1 -|- 2 log 


A- 




Too > 

^ +(A-;u)T^e"5(^-/^)^. 


(76) 


Thus for T > log(2), the first summand of the right-hand side of (74) is smaller than or equal to 

\/2a^cjsre"(^"^)^ + V2aas (^1 + 2 log + (A - fi)T^ (77) 

By Lemma 5.10(i), we obtain that for T > log(2), the second summand of the right-hand side of 

(74) is bounded from above by 




^j+(A-^)rje 


.-(a-m)t 


(78) 


Thus for T > log(2), the expression (67) is bounded from above by the sum of (77) and (78). 


Upper bound for (68) 

For (68), we have 


E" 


Mr-i r i(Z) 

■ .ri 


a 


= E" 


AAt —1 

{E{S) + Sj^) ^ (e-(/3+M)(T-T+i) _ g-(/3+/.)(r-r,)^ 

l=r(JT) 


(3 + fj, 

^(E,S, + Sg.) - e' 

l=r{JT) ' 
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aj3 




/3{/3 + fi) 


{E{S) + Sj^) 

l=r{JT) 


(79) 


since the second sum on the left-hand side of (79) has exactly Rti,t — 1 non-zero summands. The 
right-hand side of (79) is equal to 


E* 


a 


T' — 1 /' 

(E(S) + S*) + 

I _ ( \ T 


/?(/? -|- [l) 


l=r(JT) 


(80) 


Now we use 


and 


1 _ e-/3(T+i-T) ^ -Ti)-Y 


(-/3(T;+i-rO)' 


k=2 


k\ 


1 _ e-(/3+M)(T+i-T0 ^ -Ti)-Y 


k=2 


to obtain that (80) is smaller than or equal to 

Mj’ — V 


E" 


a 


^^(E(5) + Sj,) (/? + - Ti) 


-hE* 


a 


l=r(JT) 
A4j'~^ 


/?(/? -i- fi) 


{E{S) + Sj,) Y ~ 

Z=r(JT) ^ fc=2 


, -Rri,r — 1. 




-/3(T-T(+i) ^ (~/^(^Z-l-l ~ ^z))^ 


-FE* 


a 


/3 + fj, 


l=r{JT) 

A^t~1 oo 

{E{S) + Sj,) Y E 

l=r{JT) k=2 
A^t~1 oo 


k=2 

< E* (^a{E{S) + Sj,) Y l{T+i<r} (T^+i - T^)) 


A:! 


(-(/j + ^)(Tz+i-T0)^ 

k\ 


+ r(-(E(5) + 5j,) 




l=r( JT) k=2 


k\ 


(81) 


Firstly, we consider the first summand of the right-hand side of (81). Since the social index Sjj, is 
independent of all other random variables appearing in (81), this summand is equal to 


( OO 

E l{T+i<r} 

;_ f 7_^ 


We derive 


l=r(JT) 


\=t{Jt) 




m+i-Ti) 


Rti,T — 1 

Yt,-1 




m+i-Ti) 
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<E*( '^{Ti+i<T} 

^l=K(T)+l 

+ TF*{IC{T) > r(Jr)). 




m+i-Ti) 


(82) 


For log(2), the second summand of the right-hand side of (82) is smaller than or equal to 


Tc-^^t I 2 ( A -^) 

A 


log 


A 


X — fi 


^ +(A-/r)r^Te-(^-^)^ 


by Lemma 5.10(i). The first summand of the right-hand side of (82) is equal to 


E* 


l=KiT)+l 




m+i-Ti) 


JCiT),YT>0 


For the inner expectation, we have 


E 1 


‘-{T+i<T} 


- IE(l{Ti+i<T} 


Rti,T — 1 


m+i-Ti) 


IC{T),Yt>0 


Yt,-1 






m+i-Ti) 


1C{T),Yt, >0 


f{Yt, > o|/c(r)) 
p(yT > o|/c(r)) ■ 

(83) 


For the fraction, we obtain for I > JC{T) -|- 1 


p(yT, > o|/c(T)) ^ P(yr^(,) > o|/c(r)) 

P(yr>0|/C(T)) - E(p(yT>o|/c(T),yT^(^))|/c(r))- ^ ^ 

Note that, given IC{T) and we know that almost surely nodes are alive at time since a 

jump at this time has probability zero. Thus due to the Markov property for (lt)t>o and the formula 
for the extinction probability of a linear birth and death process with a general initial value given in 
Remark 3.1, the conditional probability P(yT > 0|/C(r), yr^^j.^) is equal to > which implies 

that (84) is equal to 


P(yr^(,^ > 0|/C(r)) 


< 


p(yT^(,, > o|/c(r)) 


Aez^-^ — p A 


E(1 - |/C(r)) (1 - Po{\T))F{Yt^^^^ 


> o|/c(r)) 

Thus the right-hand side of (83) is smaller than or equal to 
I RtuT — 1 


< 






Yt,-1 




{Ti+i-Ti) 


YT,,TulC{T)j 


JC{T),Yt, >0 


A 


X — p 
(85) 


By the Markov property of {Yt)t>o, we may omit the conditioning on JC{T) if we condition on 1^; and 
Ti for I > /C(r) -|- 1. Conditioning in addition on T^+i, we see that (85) is equal to 


E 




Rti,t — 1 


yr, -1 




— e 


-/.(T-T(+i) 


Yti,Ti,Ti+i 


/c(r),yT, >o) 


X — p 

( 86 ) 
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By Lemma 5.17, summing over I and taking the expectation yields the following upper bound for the 
first summand of the right-hand side of (82): 


_ t^rrr\ I 1 M \ 


JC{T),Yt,>0] . 


^l=K{T)+l 

Obviously, (87) is equal to 

^ ^ l=IC{T)+l 

\ / °° 


(87) 


E -mr, 

l=IC{T)+l 


IC{T),Yt, >0 
IC{T),Yt, >0 


We may conclude that for for T > log(2), the first summand of (81) is bounded from above by 


2aIE(5) ( V6-^EEl{y^>0} E - Ti)Y^^ ^ 

\ ^ \ 1=K{T)+1 ^ 


+[My^=o} E IE(1{t,+,<t}(T^+i-T0E^, 

^ ^ l=K.{T)+l 


1C{T),Yt, >0 
2 I }C{T),Yt, > 0 


A 




( 88 ) 


For the first outer expectation in (88), we have 




l=K{T)+l 


JC{T),Yt, >0 


\=K{T)+1 


/c(r) 


> 0 


< E 


LaxyElE( E Mt,^,<t}{Ti+i-Ti) JC{T)\ 


T f -1 

< —E( maxE ^ 

“ 2 V?<t * 


l=K{T)+l 

Ex, > 0 


Eoo > 0 


By inequality (76), we obtain that for T > log(2), this expression is smaller than or equal to 


1 + 21og(^^^^ + (A - 
For the second outer expectation in (88), we obtain 


T 

2 


E 


(i(y».o) E *:(i{T,+,<T>W+i-r,)yT7 

E 7_ ir('-r\ I 1 E 


l=]C(T)+l 


JCiT),YT, >0 


Ft > 0 


< |p(yoc = 0 |Et > 0) = 
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where the last equality follows from Lemma 5.5. 


In conclusion, for T > log(2), the first summand of the right-hand side of (81) is bounded from 


above by 


V6aj^E{S)T(^^l + 21og(^^^^ + (A - 


+ 2aE(5)(re-5^^ + ^iA^ 


Since the social index Sjj, is independent of all other random variables appearing in (81), the second 
summand of the right-hand side of (81) is smaller than or equal to 


2a 

P + n 


CO 

E(5)r( 

l=r[Jj') k=2 


(-(/3 + ^)(Tz+i-ro)' 


k\ 


By Lemma 5.18, this expression is bounded from above by 


a 


t3 + fi 


E(5)r(' Yl + 

\l _ ( T\ / / 


(89) 


^l=r{JT) 


For T > (y^ log(2) V Lemma 5.19 implies that (89), which is an upper bound 

for the second summand of the right-hand side of (81), is smaller than or equal to 

a(/3 + ^^)E{S) ( 

y A 4 (A - ^)^ 

For the third summand of the right-hand side of (81), we obviously obtain the same upper bound, 
but may replace the factor /3 + fihy f3. 


We may conclude that for T > (y^ log(2) V ^)) )^ the expression (68) is bounded from 

above by 

V6aj^E{S)T (l + 2 log + (A - fi)T^ 

+ 2aE(5) (^log + (A - ^i)T^ 

+ a(2/3 + fi)E{S) ( 

y A 4 (A — //)^ 
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,'3 . T 

-L -T^ -\ - 

^'4 ^2 (A + m) 


1 + 2 




Upper bound for (69) 

Recall that we arranged Sj^, = Sj^. Since the sum in (69) telescopes, this implies 


E* 


^(E{S) + Sj,) ^ (e-(/3++(r-r,+B_g-(/3+M)(T-TO) 


= E* 


l=r{JT) 

" (E(5) + 5j^)(l-e-(^+^)^-^-) 

(E(5) + Sj ) — (1 — ) 


/5 + + 

a 


< 


2a 


P + fj. 


(3 + fj, 

E(5) ^E* + E* ^ 




(90) 


where the last inequality holds since Sj^ is independent of all other random variables appearing in 
(90). 

By combining Lemma 5.11 and Lemma 5.12 as before, we obtain for the first conditional expectation 
on the right-hand side of (90) the upper bound 

1(a-m+(/3 + m) (log (x^) + (> - (‘)r) ) 

itr> jijlog(2). 

For the second conditional expectation on the right-hand side of (90), Corollary 3.14 implies 


E* 




< 


2A 2(A - ^i) 


/3 + /i + A 




ifr> jijlog(2). 

Altogether, we obtain that the right-hand side of (90) is smaller than or equal to 


4q; 


E(s)(4^ + 4^(iog( 


/I + + \ 


^ ^ + (A - ^i)T] + ^ 


X — fi 


/3 + /i + A 


-(\-lPT 


ifr> j^log(2). 


Conclusion 

Combining the upper bounds obtained for (65)-(69), we have for T > (y^ log(2) V ^ 

dTF(MixPo(A^), MixPo(M*)) 

<+++(,_, + /3(log(^) + (A - ,)t))o-(-oV 

+ V2ajasTe-^^->^^^ + V2aas (^1 + 2 log + (A - n)T^ 
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+ V2aasT\^ 2 ^'^ + + (A -/i)r^ 

\/6a-^^IE(5)r^(^l + 21og(^^^^ + (A - 


+ 2aIE(5)(re-i"^ + ^^^"^^ 


+ a(2/3 + m)E(5) 


■{« 


A 


r2 + 


T 


+ a(2/3 + ^)E(5)T- 


2(A + /i) 
J2(A-//) 


A 


4q! 

+ ^^E(5) 


/3 + ^ 

f \/2acJs + \/6( 


A — /i ^ /^ ~|~ A 


A A 
A 


log 

log 


A 


\ — fi 

A 


+ (A - m)t) + g-lAT^ 


+ (A — ^)T ] + 




a- 


X — fj, 


E(5) 


a 


+ -(2/3 + ^)E(5) -T + 


A + /i 


A-//; ^ j p + fi + xj 

1 + 21og^-^^^^ + (A - 

1 + 2 log( 1 ^) + (^ - 


+ (^\/2acJs + 2aE(5) + a(2/3 + ^)E(5)r^Te"^ 


-4at 


+ (aE(S)(^(A-M + /3(l + ^ 


+ 1 + 


2/3 + /i^ 


T (A-/.)r log 


X — fi 


+ (A-/r)T 


_)_ ■'^/g j _l_ (2/3 + fi)fi rj,2 _|_ 4(A — /i) ^4 A 


X — fi 


4A 


A(/3 + /i) /3 + /i/3 + A + /i 




+ eoa '^y ^ (2/3 + /x)E(5)T2e-(^+^)^ 

(A - /r)^ 


(91) 


In the remainder of the proof, we find a simpler upper bound by elementary calculations. In order to 
do so, we assume T > ^ —)1_ /pj^gn -^e have 


I + 2log( +{X-fi)T<^{X- fi)T, 

g-i(A-M)T ^ g-i(A-/.)rg-jL(A-^)T ^ 


12 


{X-fi)T 




12 


5 {X-fi)T 


g-i(A-/.)r 


g-i(A-M)T ^ g_i(A-M)rg-i(A-/.)r < ^-|g-i(A-/.)r < 2^-i(A-M)T 

“5 

and finally 2 < (A — /x)T. Thus the first three lines of the right-hand side of (91) are smaller than or 
equal to 


a 


5V6 A 


2 X-fi 5 


+ 7 + ( /? + 77 


229 2 

+ 


5(A — /i) X -\- fjj 


E(5) + ‘^V2as\ (A - M)T2e-i(^-^)^. 


(92) 
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Note that we have the following inequalities: 

, , A 

log 




1 + log 


A 


A — // 


+ (A — fi)T < 2(A — i^)T, 


1 1 


“ 2// r 


Thus the last four lines of (91) are smaller than or equal to 


a 


4(A-;u) 

/3A 


1+0^ 1 + 


/? + A 

P + n 


+ 1 + 


2(3 + 


T]{X- n)T]T]E{S) 


A A 


+ (^ + ^E(5)t 2 + V6^E(5)r 


+ . ^\ ns) + 30 -^'(^ + ^ ( 2/3 + ^)e{S)T + ^ 


A 


X{(3 + fi) 


fi{X - //)4 


/3 + /i/3 + A + /i 


E(5) 


(93) 


We first consider those terms in (93) that depend on E(5). Since 2 < (A — /u)r, grouping the terms 
where (3 dominates in the denominator, we have 

8aE(5) A — fi (x-ii)T , 4aAE(5') tx_^)T 6(A — fi) (x-ii)T 

(3 X ^/3{/3 + X + fi) ^ 


< 


< 


< 


aE{S) 


5aE(5) 

~1~ 


10(A - 


(A - 


(94) 


grouping the (3-bee terms in (93) yields 

aE{S) (12^^T + ^ ^2 ^ ^ ^ t] 

V ^ 4A X-fj. (X-n)^ J 

+ aE{S) 


< 12aE(5)A-^Te-(^-^)^ 
A 


A 


< aE(5) 


3(A —/u)^ fj? y/6X 15A^(A + ^) 


(mt + 1) ++ ^ + ^4^E^) r2e-<^-<-v 

(A-,)3 (95) 


and considering the terms in (93) where (3 dominates in the numerator, we obtain 


a, 


(3E{S)^T^e-^^->^^^ + 16a/3E(5)^^^AL^re-(^-^)^ + 6a/3E(5) M)^ ^3g-(A-M)T 

2A f-^\^ — ^ 

< a/3E(5) (^ + 2^2g-(A-M)T_ 


2A /i(A —/r)^ ' ^ A 

For the one term in (93) that depends on cr^, the inequality 2 < (A — fi)T implies 

, 3(A-^)2 2 


( 96 ) 


A/2(^r + 

A 


A 


< A/2f ~ 

y y 2A A 


asT e 
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2A 

< 3V2XasT^e-^^-f^'^^. (97) 

Combining (94)-(97), we obtain the following upper bound for the right-hand side of (93); 




«/? + T+ - + 7; 


4/37A ' PJ (A-^)3 


+ ( 6 T + ^ + |(/xr + 3) + 


S') 


E(5)r^e 


a-(\-p)T 


+ 3V2aXasT^e-^^-^^^. 


(98) 


The total upper bound for the theorem is the sum of (92) and (98) and this sum is obviously of the 
desired order. □ 


6 Negligibility of multiple edges 


In this section, we show that multiple edges are negligible (Lemma 6.1 below) and use this result to 
prove Corollary 1.4 from the introduction, which states our main result for the case where multiple 
edges are ignored. 


6.1 Lemma 

The probability that an individual picked uniformly at random at time T has at least one multiple 
edge given the number of nodes is positive at time T is of order 0 (T^e“ 6 (^“^)^) as T —)■ oo. 


Proof. Let Nt denote the degree of the randomly picked node Jt at time T, i.e. the number of edges 
that are incident to the node picked uniformly at random at time T. Given Nt, let pi < ... < pNj. 
be the birth times of these edges. Supposing we condition on Jt, Nt, pi < ... < pnt (lt)o<t<T 
and are at time pi, the probability that we do not create a multiple edge that stays a multiple edge 
up to time T is 

( 1 - 37 ^) %,,><+.)■ 

Thus we obtain for the probability that Jt has at least one multiple edge is 


^ ^ “ Yp., - 1 j^ib.2>3} 


^ “ y _i)^{T>3>4} • ••• 

-'P3 ^ ' 


Nt — 1 1 

1 - ^ - 1 


< E 1 - 1 - 


Nt — 1 


min It — 1 

T-Aj.^{T)<t<T 




1 


T-Aj^(T)<t<T 


{ min It —1 >A^t} 

Yx-r)T 


< F(Nt > > 0 ) -|- P( min 17 — 1 < es 

\T-Aj.^{T)<t<T 


y — 1 / 

^ pNj, ■*- / 

Yt > 0 

Yt > 0 


Yt > 0 


+ 1 - 1 - 




(99) 


Since the asymptotic degree distribution is MixPo(M) with M given by (35), we obtain by conditioning 
on M that the second moment E(N|.) is equal to 

2aE{S) 2aE((S + E(S)f) , , „ , .... 

+ T";—^—TT";— , ^ , —77 (cf. Subsection 3.3 m [BLIO]). 


A + /3 + /i (A + /3 + /i)(A + 2(/3 + p)) 
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Thus Theorem 5.4 and the Markov inequality imply 
F{Nt > > 0) < 


< 


2aE{S) 


+ 


2aE((5 + E(5))2 


A + /3 + /i (A + /3 + /i)(A + 2(/3 + /i)) 

For the second summand of the right-hand side of (99), we obtain 




min Yt — 1 < 
T-Aj^(T)<t<T 


Tr > 0 ) < P{ max y > ^ ^ —- 

^<t<T yt -I- 1 


Yt > 0 


+ p Aj^(r) < 


Yt > 0 . 


( 100 ) 


By Lemma 5.10, the second summand of the right-hand side is of order 0{e 2 !^ /^)^) as T —)■ 00 . 


With Too = lim Yt G {0, 00 }, we have 

t^OO 


1 1 

max — > —j-;——- 

y<t<T Yt _|_ I 


,Too > 0 


Yt >0 = 


< 


™ax ^ > 1 , — , 1^00 > 0 


Tr<t<T 


p(yr > 0) 


P( max ^ > 


\yr ^ ITT m 

y<t<T 


,A"oo > 0 


P(Too > 0) 


1 1 

< P max — > —j--^- 

\y<t<TYt g3('*'-A‘)'r _j_ 2 

Thus the hrst summand of the right-hand side of (100) is smaller than or equal to 

Toe > 0 ) +p(yoc = o|yT >0). 


yoo > 0 . 


1 1 

P( max — > ^r--^- 

~.y<t<TYt g3('*'-A‘)'r _|_ 2 


( 101 ) 


The second summand is smaller than or equal to Lg-(-^-/^)7’ j^y Lemma 5.5. By Corollary 5.8 and the 
inequality (2) in Theorem 6.14 on page 99 in [Yeh95], the hrst summand of (101) is bounded from 
above by 


E 


AAo > 0^ (e5(^-^)^ + 1) = 0(Te-5(^-^)^)(ei(^-^)^ + 1) = 0(Te-^(^-^)^), 


where the hrst equality follows from Lemma 5.14. 

In order to obtain an upper bound for the third summand on the right-hand side of (99), we apply 
the following general lemma, which can be proved by elementary calculations; cf. [AE06], proof of 
Theorem 6.23. 

6.2 Lemma 

Let (2:n)neN be a non-negative sequence. Then for all n G N with ^ < 1, we have 

2 n 


g-^n _ 



V nj 
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The third summand of the right-hand side of (99) is smaller than or equal to 


1 - 1 - 




( 102 ) 


where [■] denotes the ceiling function. Let T > Then (102) is bounded from above by 


1-11- -11 A ^ _ 1 


- 1]4 


pg^(A-M)r _ 


< 1 — exp( — 


1 


+ 


exp 


- 


V (eT2(^-A‘T-l]2 

1 


- 1 - 


1 


- 1]3 


|-g(A-M)T/12_]^-| 


V _ lY 

Thus by Lemma 6.2, the right-hand side of (103) is bounded from above by 
1 e 4 4e 


(103) 




< 




l-gl^lA-Atlr - 1-|2 - 1]3 gi(-^-/^)r 


= 0(e"^(^"^)^). 


Altogether, we obtain that the probability that Jt has at least one multiple edge is of the order 

o(r2e-^(^-^)^). □ 


Proof of Corollary 1.4 

Recall that ut denotes the distribution of the number of neighbours, I't the degree distribution at time 
t and u the asymptotic degree distribution. Lemma 6.1 implies that dTv{^t-,T^t) = as 

t —7- oo. From Theorem 1.2, we know that dTvi^t,^) = as t ^ oo. Thus the triangle 

inequality yields the desired result. □ 
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